-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[EPIC] A list of performance improvement tickets #5546
Comments
Thanks @jaylmiller !
I dont think #846 is being worked on, but given that the GroupByHash now uses the row format, I am not sure how relevant it is. Please do feel free to comment on any ticket that is interesting -- no need to have it assigned to work on something! Thanks for all the help so far on making Sort faster |
Sounds good! #846 was kindof arbitrary to be honest 😅, I'll read thru them more closely and pick one that seems interesting. |
Awesome -- thanks @jaylmiller I think in general the "make aggregation faster" #4973 and high cardinality groups #5547 are the most pressing things from a performance perspective. However, they are also the ones with the most active thought / work on them, so they probably need some more coordination, which you may or ma not be interested in doing |
This has a list of performance improvements:
Aggregator
, grouping, aggregation #4973FilterExec
to avoid unecessary output creation #5436HashJoinExec
to avoid unecessary output creation #6768CoalesceBatchesExec
to improve performance #7957The text was updated successfully, but these errors were encountered: