-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
opt: make lookup join more expensive
This change adds a constant factor to the cost of fetching a row for a lookup join scan. This constant was picked somewhat arbitrarily and can be made more principled going forward. This change was motivated by this query: ``` SELECT count(*) FROM lineitem JOIN supplier ON l_suppkey = s_suppkey ``` This query runs an approximately 1:600 join, supplier having around 10,000 rows and lineitem having around 6,000,000. Previously a lookup-join was chosen. After this change, a merge-join is chosen, which is roughly 2x as fast. Run locally, the times for each join strategy are (recall `INNER HASH JOIN` and `INNER LOOKUP JOIN` are not commutative): ``` SELECT count(*) FROM lineitem INNER HASH JOIN supplier ON l_suppkey = s_suppkey => 4.67s => cost: 6,446,371 SELECT count(*) FROM supplier INNER HASH JOIN lineitem ON l_suppkey = s_suppkey => 4.48s => cost: 6,476,327 SELECT count(*) FROM lineitem INNER MERGE JOIN supplier ON l_suppkey = s_suppkey => 3.68s => cost: 6,431,793 SELECT count(*) FROM lineitem INNER LOOKUP JOIN supplier ON l_suppkey = s_suppkey => 10.16s => old cost: 36,765,282 => new cost: 42,746,162 SELECT count(*) FROM supplier INNER LOOKUP JOIN lineitem ON l_suppkey = s_suppkey => 6.53s => old cost: 6,330,224 => new cost: 12,311,104 ``` This was validated experimentally by running the following three parameterized queries via exprgen: Merge Join ---------- ``` (MergeJoin (Scan [ (Table "lineitem") (Cols "l_suppkey") (Index "lineitem@l_sk") (HardLimit $lineitem_rows) ] ) (Scan [ (Table "supplier") (Cols "s_suppkey") (HardLimit $supplier_rows) ] ) [ ] [ (JoinType "inner-join") (LeftEq "+l_suppkey") (RightEq "+s_suppkey") (LeftOrdering "+l_suppkey") (RightOrdering "+s_suppkey") ] ) ``` Hash Join --------- ``` (InnerJoin (Scan [ (Table "supplier") (Cols "s_suppkey") (Index "supplier@s_nk") (HardLimit $supplier_rows) ] ) (Scan [ (Table "lineitem") (Cols "l_suppkey") (Index "lineitem@l_sk") (HardLimit $lineitem_rows) ] ) [ (Eq (Var "l_suppkey") (Var "s_suppkey")) ] [ ] ) ``` Lookup Join ----------- ``` (MakeLookupJoin (Scan [ (Table "supplier") (Index "supplier@s_nk") (Cols "s_suppkey") (HardLimit $supplier_rows) ] ) [ (JoinType "inner-join") (Table "lineitem") (Index "lineitem@l_sk") (KeyCols "s_suppkey") (Cols "l_suppkey") ] [ ] ) ``` Release note (sql change): the cost-based optimizer will now pick lookup-joins less frequently.
- Loading branch information
Justin Jaffray
committed
Mar 12, 2019
1 parent
6be06f2
commit 97ad0e9
Showing
12 changed files
with
123 additions
and
77 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.