-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spilled data size in CLI and WebUI #426
base: cw/spill-space-manager/2
Are you sure you want to change the base?
Conversation
@pnowojski I think we should remove the Driver and Pipeline level spill stats collecting and maintaining. They're not used anywhere. |
38cc4d1
to
bc88476
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Generally looks good, however is there no tests for explain analyze output?
@@ -323,6 +331,11 @@ private static String formatFragment(Metadata metadata, Session session, PlanFra | |||
return builder.toString(); | |||
} | |||
|
|||
private static boolean isNonZero(DataSize dataSize) | |||
{ | |||
return dataSize != null && dataSize.getValue() != 0; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why it can be null?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it can't, it was a reflex to implement a static helper method as null safe, I can change that if you want
bc88476
to
8a47a00
Compare
Add tracking of spilled data size on various levels (from operator to query) and expose it in JSON.
Data size added to EXPLAIN ANALYZE output, at Fragment and Operator levels. This information is added to the "Cost" line and displayed only if the value is non-zero.
When CLI is started with --debug then spilled data size for the entire query is being displayed (both in the running total, and the final summary). The information is added in a separate line below "Parallelism".
8a47a00
to
c6747df
Compare
@ilfrin for completeness, let's print total data spilled in the Presto UI on the query details page under Resource Utilization Summary (next to the CPU & Memory usage) |
lgtm |
Added spilled data size to the Query Details page in the Resource Utilization Summary section and to the Live Plan page in the Web UI just below the "Splits" line (per stage).
c6747df
to
7c1d6d5
Compare
@KBP-TDC done |
@pnowojski @sopel39