Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] fixed_length_byte_array.parquet loads corrupted data instead of error #9236

Open
jlowe opened this issue Sep 13, 2023 · 0 comments
Open
Labels
bug Something isn't working cudf_dependency An issue or PR with this label depends on a new feature in cudf reliability Features to improve reliability or bugs that severly impact the reliability of the plugin

Comments

@jlowe
Copy link
Member

jlowe commented Sep 13, 2023

Related to #9233 and #9235. Loading https://github.com/apache/parquet-testing/blob/master/data/fixed_length_byte_array.parquet produces corrupted data rather than an error. Spark 3.5, parquet-mr 1.13.1, and pyarrow 13 all report an unexpected EOF error on this file. When rapidsai/cudf#14104 is fixed, we should update parquet_testing_test.py accordingly.

@jlowe jlowe added bug Something isn't working ? - Needs Triage Need team to review and classify cudf_dependency An issue or PR with this label depends on a new feature in cudf labels Sep 13, 2023
@mattahrens mattahrens added reliability Features to improve reliability or bugs that severly impact the reliability of the plugin and removed ? - Needs Triage Need team to review and classify labels Sep 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cudf_dependency An issue or PR with this label depends on a new feature in cudf reliability Features to improve reliability or bugs that severly impact the reliability of the plugin
Projects
None yet
Development

No branches or pull requests

2 participants