Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support all the codecs supported by Avro #7718

Merged
merged 2 commits into from
Oct 2, 2023

Conversation

sarutak
Copy link
Member

@sarutak sarutak commented Oct 1, 2023

Which issue does this PR close?

Closes #7717

Rationale for this change

Avro supports deflate, snappy, bzip2, xz and zstandard but currently Datafusion only enables snappy.

What changes are included in this PR?

Change common/Cargo.toml to enable the codecs supported by Avro.

Are these changes tested?

Added new tests.

Are there any user-facing changes?

Yes. But this change doesn't break compatibility.

@sarutak
Copy link
Member Author

sarutak commented Oct 1, 2023

Waiting for the PR in arrow-testing merged.

@sarutak sarutak changed the title Support all the codes supported by Avro Support all the codecs supported by Avro Oct 1, 2023
alamb added a commit to apache/arrow-testing that referenced this pull request Oct 1, 2023
This PR proposes to add xz, zstd, bzip2 and snappy variant of
`alltypes_plain.avro`.
This change is necessary for [this
PR](apache/datafusion#7718).

The contents is the same as existing `alltypes_plain.avro`.
The content represented as JSON is as follows.
```
{"bigint_col":0,"bool_col":true,"date_string_col":[48,51,47,48,49,47,48,57],"double_col":0.0,"float_col":0.0,"id":4,"int_col":0,"smallint_col":0,"string_col":[48],"timestamp_col":1235865600000000,"tinyint_col":0}
{"bigint_col":10,"bool_col":false,"date_string_col":[48,51,47,48,49,47,48,57],"double_col":10.1,"float_col":1.100000023841858,"id":5,"int_col":1,"smallint_col":1,"string_col":[49],"timestamp_col":1235865660000000,"tinyint_col":1}
{"bigint_col":0,"bool_col":true,"date_string_col":[48,52,47,48,49,47,48,57],"double_col":0.0,"float_col":0.0,"id":6,"int_col":0,"smallint_col":0,"string_col":[48],"timestamp_col":1238544000000000,"tinyint_col":0}
{"bigint_col":10,"bool_col":false,"date_string_col":[48,52,47,48,49,47,48,57],"double_col":10.1,"float_col":1.100000023841858,"id":7,"int_col":1,"smallint_col":1,"string_col":[49],"timestamp_col":1238544060000000,"tinyint_col":1}
{"bigint_col":0,"bool_col":true,"date_string_col":[48,50,47,48,49,47,48,57],"double_col":0.0,"float_col":0.0,"id":2,"int_col":0,"smallint_col":0,"string_col":[48],"timestamp_col":1233446400000000,"tinyint_col":0}
{"bigint_col":10,"bool_col":false,"date_string_col":[48,50,47,48,49,47,48,57],"double_col":10.1,"float_col":1.100000023841858,"id":3,"int_col":1,"smallint_col":1,"string_col":[49],"timestamp_col":1233446460000000,"tinyint_col":1}
{"bigint_col":0,"bool_col":true,"date_string_col":[48,49,47,48,49,47,48,57],"double_col":0.0,"float_col":0.0,"id":0,"int_col":0,"smallint_col":0,"string_col":[48],"timestamp_col":1230768000000000,"tinyint_col":0}
{"bigint_col":10,"bool_col":false,"date_string_col":[48,49,47,48,49,47,48,57],"double_col":10.1,"float_col":1.100000023841858,"id":1,"int_col":1,"smallint_col":1,"string_col":[49],"timestamp_col":1230768060000000,"tinyint_col":1}
```
@sarutak sarutak marked this pull request as ready for review October 1, 2023 14:19
@Dandandan Dandandan merged commit e97ed66 into apache:main Oct 2, 2023
23 checks passed
Ted-Jiang pushed a commit to Ted-Jiang/arrow-datafusion that referenced this pull request Oct 7, 2023
* Support all the codes supported by Avro

* Update testing dir
@andygrove andygrove added the enhancement New feature or request label Oct 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request sqllogictest SQL Logic Tests (.slt)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support all the codecs supported by Avro
4 participants