forked from facebookincubator/velox
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Summary: facebookincubator#8920 Pushdown of random sampling into file readers. Changes include: 1. Recognize random sampling from remaining filter and extract sample rate from it. 2. Construct the stateful `RandomSkipTracker` that converts Bernoulli distribution into geometric distribution. 3. For DWRF, we add extra stripe level skipping, to avoid loading huge stripe level metadata with flat map columns. 4. In struct column reader, we generate the row numbers from geometric distribution directly, instead of doing Bernoulli trial for every row. The majority of the benefit so far comes from (avoid) reading flat map columns. For a query that was using more than 33.85 CPU days (then time out), we can now finish it using 6.49 CPU days (Presto Java is using 15.96 days). Differential Revision: D54331932
- Loading branch information
1 parent
93b2544
commit 7e9ea67
Showing
29 changed files
with
508 additions
and
115 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.