[BUG] Investigate multiple calls to cudf::rolling_window() from GpuWindowExec #1931

mythrocks · 2021-03-14T02:30:16Z

This was noticed tangentially, when investigating cudf/pull/7568.

The minimal test-case that reproduced the libcudf issue above consisted of a 0.5KB Parquet dataset containing 15 records across 5 groups.

grouped_rolling_window() seemed invoked once per group, instead of once for the entire column, likely in a bid to keep groups whole. If groups could be packed into larger inputs for grouped_rolling_window(), the performance should be far better.

The text was updated successfully, but these errors were encountered:

jlowe · 2022-01-13T23:46:53Z

@mythrocks Is this still relevant?

revans2 · 2022-01-14T16:45:16Z

@mythrocks Is this still relevant?

We could save some time in building the full offsets that are passed to the underlying window operations. But we have not even measure how much of the time that is taking up.

mythrocks · 2022-01-28T07:39:32Z

Sorry I missed this.
I didn't actually see a slowdown, just that there seemed to be multiple calls coming through.

I can close this issue for now, and reopen if we find a slow path here.

revans2 · 2022-01-28T14:23:07Z

I want to keep it open because we know that there is duplicate code being called. It may be small, but I want to preserve this because at some point we are going to want to go through the backlog and start fixing things there. Just unassigned yourself from this for now.

mythrocks added bug Something isn't working ? - Needs Triage Need team to review and classify labels Mar 14, 2021

mythrocks self-assigned this Mar 14, 2021

mythrocks added performance A performance related task/issue and removed ? - Needs Triage Need team to review and classify bug Something isn't working labels Mar 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Investigate multiple calls to cudf::rolling_window() from GpuWindowExec #1931

[BUG] Investigate multiple calls to cudf::rolling_window() from GpuWindowExec #1931

mythrocks commented Mar 14, 2021

jlowe commented Jan 13, 2022

revans2 commented Jan 14, 2022

mythrocks commented Jan 28, 2022

revans2 commented Jan 28, 2022

[BUG] Investigate multiple calls to cudf::rolling_window() from GpuWindowExec #1931

[BUG] Investigate multiple calls to cudf::rolling_window() from GpuWindowExec #1931

Comments

mythrocks commented Mar 14, 2021

jlowe commented Jan 13, 2022

revans2 commented Jan 14, 2022

mythrocks commented Jan 28, 2022

revans2 commented Jan 28, 2022