Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Java] Integrating variadicBufferCounts to RecordBatch in Java #41729

Open
2 of 37 tasks
vibhatha opened this issue May 20, 2024 · 2 comments
Open
2 of 37 tasks

[Java] Integrating variadicBufferCounts to RecordBatch in Java #41729

vibhatha opened this issue May 20, 2024 · 2 comments

Comments

@vibhatha
Copy link
Collaborator

vibhatha commented May 20, 2024

Describe the enhancement requested

Once the Utf8View and BinaryView have been introduced, we have added a new attribute to the RecordBatch spec. Once this change is integrated, we need to update the existing interfaces which uses this and related modules which use those interfaces. To introduce this change, it would be better to include this by introducing them component wise so that it can be tested in a streamlined manner.

Following are list of modules and their classes which are related with the changes done to update the RecordBatch specification.

Vector

  • JsonFileReader
  • JsonFileWriter
  • ArrowReader
  • TestValueVector
  • TestVarcharViewVector
  • TestVectorUnloadLoad
  • TestArrowReaderWriter
  • TestRoundtrip
  • TestJsonFile
  • ArrowRecordBatchBenchmarks
  • MessageSerializerTest
  • ArrowDictionaryBatch
  • ArrowStreamReader
  • TestValidateVector

Performance

  • VectorLoaderBenchmark
  • VectorUnloaderBenchmarks

C

  • RoundtripTest
  • StreamTest

Dataset

  • TestDataset
  • TestFileSystemDataset
  • ArrowScannerReader
  • NativeScanner

Flight

  • TestFlightClient
  • TestDoExchange
  • IntegrationTestClient
  • FlightStream
  • IntegrationProducer
  • ExpirationTimeDoGetScenario
  • VectorSchemaRootTransformerTest
  • FlightSQLExample
  • TestBasicOperation

Compression

  • TestCompressionCodec

Gandiva

  • ProjectorTest
  • ProjectorDecimalTest
  • FilterTest
  • FilterProjectTest
  • BaseEvaluatorTest

Component(s)

Java

@vibhatha
Copy link
Collaborator Author

@lidavidm I organized the above list of affected components after the realization I mentioned here.

FiV0 added a commit to FiV0/arrow that referenced this issue May 20, 2024
FiV0 added a commit to FiV0/arrow that referenced this issue May 20, 2024
@vibhatha
Copy link
Collaborator Author

vibhatha commented May 20, 2024

The evaluation was done as a part of the PR #41676.
But I believe this PR should be partitioned into a few components.

As the first two steps and with the goal of adding C Data interface to StringView, I am thinking about the following steps at the moment.

Introduce the variadicBufferCount to RecordBatch

  • TestVectorUnloadLoad
  • TestViewVarcharVector

Add the TypeLayout features.

What has been done here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant