feat: add z-order optimize #1429

wjones127 · 2023-06-03T18:24:34Z

Description

Implements Z-order in Rust. This is a very basic version that requires loading the whole partition into memory for sorting. In the future, we can implement a DataFusion-based code path that allows sorting with spilling to disk for even larger optimize jobs.

The Z-order function here is based on Arrow's row format. We truncate to take the first 16 bytes of data (padding with zeros at end as necessary). So for variable-width columns like strings, this means that we are only using the first 15 bytes of the string (the first byte is used to differentiate null and empty strings, see row format docs). If a user has a string column where they all share the same prefix, this z-order function won't work well for them. But in many common cases it will work.

We'll also expose this in Python as a follow up.

Related Issue(s)

closes Implement Z-order sorting option in Optimize operation #1127

Documentation

roeap · 2023-06-03T23:42:24Z

took the liberty of resolving the merge conflicts I caused :)

roeap

Very nice ❤️ - folks are going to be quite excited about this ...

Left a few minor comments, mainly to also better understand the inner workings :).

roeap · 2023-06-03T23:43:35Z

rust/src/operations/optimize.rs

+        read_stream: impl Future<
+                Output = Result<
+                    BoxStream<'static, Result<RecordBatch, ParquetError>>,
+                    DeltaTableError,
+                >,
+            > + Send
+            + 'static,


nit: should this be defined as a separate type?

LMK what you think of the type I added.

roeap · 2023-06-03T23:46:15Z

rust/src/operations/optimize.rs

@@ -414,6 +439,56 @@ impl MergePlan {
        Ok((partial_actions, partial_metrics))
    }

+    /// Creates a stream of batches that are zordered
+    ///
+    /// Currently requires loading all the data into memory.


sounds a bit like loading all data from the table - for a partitioned table, this would be loading a single partition?

Yeah if it's non-partitioned, it's everything. If it's partitioned then only max_concurrent_tasks number of partitions are loaded at once.

Added a comment for this.

roeap · 2023-06-03T23:52:34Z

rust/src/operations/optimize.rs

+                .as_slice(),
+        )
+        .unwrap();
+        let indices = arrow::compute::lexsort_to_indices(


I may be completely off here, but we may just get away with sorting an enumerate()'ed vector of values here. IIUC, we should not have null values at this point since this is already the encoded array and they byte values can be compared using the native methods. If that is true, we could avoid the non-trivial internals of the lexsort_to_indices function.

I guess the optimizations on the CPU level mentioned in the arrow row blog posts should take effekt either way since we are comparing bytes nonetheless. So leave it to your expertise if this is worthwhile :)

Oh I think you are right! I'll give that a shot.

Nicely simplified :)

roeap

Great Work!

github-actions bot added binding/python Issues for the Python package binding/rust Issues for the Rust crate rust labels Jun 3, 2023

wjones127 added 2 commits June 3, 2023 14:17

feat: add zorder key function

09c444f

implement z-order in Rust

cca1ac6

wjones127 force-pushed the z-order branch from 58f47bb to cca1ac6 Compare June 3, 2023 21:18

github-actions bot removed the binding/python Issues for the Python package label Jun 3, 2023

refactor util

e882d50

wjones127 force-pushed the z-order branch from 5fafbd9 to 859356b Compare June 3, 2023 23:01

test zorder

54e5e75

wjones127 force-pushed the z-order branch from 859356b to 54e5e75 Compare June 3, 2023 23:13

wjones127 marked this pull request as ready for review June 3, 2023 23:24

wjones127 requested review from houqp, xianwill, fvaleye, roeap, rtyler and mosyp as code owners June 3, 2023 23:24

Merge branch 'main' into z-order

edaeb21

roeap previously approved these changes Jun 4, 2023

View reviewed changes

fix bug in planning

cae1097

wjones127 dismissed roeap’s stale review via cae1097 June 4, 2023 05:43

roeap approved these changes Jun 4, 2023

View reviewed changes

wjones127 merged commit 27043ea into delta-io:main Jun 6, 2023

wjones127 deleted the z-order branch June 6, 2023 01:51

cjolowicz mentioned this pull request Sep 5, 2024

Z-order is no-op for strings with identical prefix of length >= 14 #2844

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add z-order optimize #1429

feat: add z-order optimize #1429

wjones127 commented Jun 3, 2023 •

edited

Loading

roeap commented Jun 3, 2023

roeap left a comment

roeap Jun 3, 2023

wjones127 Jun 4, 2023

roeap Jun 3, 2023

wjones127 Jun 4, 2023

wjones127 Jun 4, 2023

roeap Jun 3, 2023

wjones127 Jun 4, 2023

wjones127 Jun 4, 2023

roeap left a comment

feat: add z-order optimize #1429

feat: add z-order optimize #1429

Conversation

wjones127 commented Jun 3, 2023 • edited Loading

Description

Related Issue(s)

Documentation

roeap commented Jun 3, 2023

roeap left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

roeap left a comment

Choose a reason for hiding this comment

wjones127 commented Jun 3, 2023 •

edited

Loading