Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: ensure DataFusion SessionState Parquet options are applied to DeltaScan #2702

Merged
merged 5 commits into from
Jul 24, 2024

Conversation

alexwilcoxson-rel
Copy link
Contributor

Description

When creating the ParquetExec plan in DeltaScanBuilder. The parquet options on the ParquetExec are left with their default values. This PR uses the SessionState on the builder to clone the ParquetOptions from.

This allows you to create your SessionContext/SessionState with additional Parquet reader options enabled (row filter pushdown, page index, rog group bloom filter pruning, etc).

Related Issue(s)

Documentation

HawaiianSpork and others added 3 commits June 20, 2024 22:35
By casting the read record batch to the delta schema datafusion can read tables where the underlying parquet files can be cast to the desired schema.
@github-actions github-actions bot added the binding/rust Issues for the Rust crate label Jul 24, 2024
@ion-elgreco ion-elgreco enabled auto-merge (squash) July 24, 2024 16:57
Copy link
Collaborator

@ion-elgreco ion-elgreco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good 👍

@ion-elgreco ion-elgreco merged commit 292e3c2 into delta-io:main Jul 24, 2024
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
binding/rust Issues for the Rust crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants