Z-order is no-op for strings with identical prefix of length >= 14 #2844
Labels
binding/python
Issues for the Python package
binding/rust
Issues for the Rust crate
bug
Something isn't working
Environment
Delta-rs version:
0.19.1
Binding:
Python and Rust
Environment:
Bug
What happened:
Apply z-order to a Delta Table on a column that contains strings with identical prefixes of at least 14 characters. The records in the new Parquet files retain their original order.
I initially witnessed this when z-ordering a large partition on ISO 8601 timestamps using delta-rs in Rust. I've since reproduced this with Python bindings and a small data frame using strings containing zero-padded integers (see repro below).
What you expected to happen:
The resulting Parquet files are ordered by the column specified for z-ordering.
How to reproduce it:
Run this with uv:
Output:
More details:
N/A
The text was updated successfully, but these errors were encountered: