You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
clee704
changed the title
Support reading/writing key-value metadata from/to ColumnMetaData
[C++][Python] Support reading/writing key-value metadata from/to ColumnMetaData
May 8, 2024
clee704
changed the title
[C++][Python] Support reading/writing key-value metadata from/to ColumnMetaData
[C++][Python] Support reading/writing key-value metadata from/to ColumnChunkMetaData
May 8, 2024
…tadata from/to ColumnChunkMetaData (#41580)
### Rationale for this change
Parquet standard allows reading/writing key-value metadata from/to ColumnChunkMetaData, but there is no way to do that with Parquet C++.
### What changes are included in this PR?
Support reading/writing key-value metadata from/to ColumnChunkMetaData with Parquet C++ reader/writer. Support reading key-value metadata from ColumnChunkMetaData with pyarrow.parquet.
### Are these changes tested?
Yes, unit tests are added
### Are there any user-facing changes?
Yes.
- Users can read or write key-value metadata for column chunks with Parquet C++.
- Users can read key-value metadata for column chunks with PyArrow.
- parquet-reader tool prints key-value metadata in column chunks when `--print-key-value-metadata` option is used.
* GitHub Issue: #41579
Lead-authored-by: Chungmin Lee <[email protected]>
Co-authored-by: mwish <[email protected]>
Signed-off-by: mwish <[email protected]>
Describe the enhancement requested
According to the standard, ColumnMetaData can have key_value_metadata which is a list of KeyValue.
https://parquet.apache.org/docs/file-format/metadata/
Component(s)
C++
The text was updated successfully, but these errors were encountered: