Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PARQUET-2352: Allow truncation of row group min_values/max_value stat…
…istics (#216) This updates the spec to allow truncation of row group min_values/max_value statistics so that readers can take advantage of row group pruning for predicates on columns containing long strings. https://issues.apache.org/jira/browse/PARQUET-1685 already introduced a feature to parquet-mr which allows users to deviate from the current spec and configure truncation of row group statistics. This change also adds is_max_value_exact/is_min_value_exact to allow writers to specify when the max_value/min_value are the actual max and min values found on the column chunk.
- Loading branch information