Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow Setting Column Specific Parquet Write Options via SQL StatementOptions #7442

Closed
devinjdangelo opened this issue Aug 29, 2023 · 1 comment · Fixed by #7466
Closed

Allow Setting Column Specific Parquet Write Options via SQL StatementOptions #7442

devinjdangelo opened this issue Aug 29, 2023 · 1 comment · Fixed by #7466
Labels
enhancement New feature or request

Comments

@devinjdangelo
Copy link
Contributor

Is your feature request related to a problem or challenge?

Currently StatementOptions only enables setting parquet writer options for all columns at once (the default value). In many cases, we would want to set column level encodings or other settings based on the data type and properties of that specific column.

Describe the solution you'd like

Implement a syntax to allow specifying column level parquet settings. A possible syntax for this would be:

COPY my_table
TO ./path
(format parquet,
encoding_default plain,
encoding '(colA RLE, nested.col.path.colB dictionary, ...)',
)

Describe alternatives you've considered

DuckDB supports a different syntax than described above for setting field_id for each column. See https://duckdb.org/docs/sql/statements/copy.html

Additional context

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
2 participants