write_parquet(..., pyarrow_options={'partition_cols': [..., ]}) munges partition column #17619
Open
2 tasks done
Labels
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
python
Related to Python Polars
Checks
Reproducible example
Log output
Issue description
Use of
pyarrow_options={'partition_cols': [..., ]}
appears to mangle the column being partitioned. As far as I can tell, the example is consistent with the documentation.Not sure if relevant, but it appears that the partitioned column here is regarded as a
large_string
even though each value is a single character.Thank you for your time and effort. Please let me know if additional information is helpful, or if there is a suggested direction which I could follow to issue a PR (if one is warranted).
Expected behavior
I would expect the output directory names to correspond to the values on write, and I would expect the column values to be unchanged on read.
Installed versions
The text was updated successfully, but these errors were encountered: