-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Aggregate push down for incremental scan #10538
Conversation
In this PR, I have basically extracted the code for building an |
cc @szehon-ho Could you please take a look when you have a moment? Thanks a lot! |
|
||
assertThat(explain1).contains("LocalTableScan", "min(data)", "max(data)", "count(data)"); | ||
|
||
Dataset<Row> noPushdownResult = |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Naming nit: Could we suffix the Dataset variables with ds
so this would be noPushdownDs
and the other one would be pushdownDs
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also I don't think there's too much additional value in verifying the noPushdown case here. I think the test should just enable agg pushdown, and explicitly verify the expected results. But not super opinionated.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @amogh-jahagirdar for the review! I have changed the variable names and removed the noPushdown cases.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @huaxingao the change looks great to me. I'll wait in case @szehon-ho or others have comments before merging.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm, thanks!
Merged, thanks @huaxingao and @amogh-jahagirdar for review, @RussellSpitzer for original review |
Thanks @amogh-jahagirdar, @szehon-ho, and of course, @RussellSpitzer |
I will have a follow-up PR to port the changes to Spark 3.4. @szehon-ho |
Enable Aggregate push down for incremental scan. Here is the original PR