Refactor of rolling_window implementation. #8158

nvdbaranec · 2021-05-04T19:58:24Z

This is an attempt to significantly reduce the complexity of the logic of the SFINAE and various functors/functions inside of rolling_detail.cuh. There are 2 major components:

It introduces the idea of device "rolling operators". These operators are essentially just the implementations of what were formerly the process_rolling_window() functtions. However, they provide they key mechanism for removing the complex SFINAE out of the core logic. They do this by providing their own logic that can throw for invalid aggregation/type pairs at construction time, internally.
It refactors the type and aggregation-dispatched functors to use the collector/finalize paradigm used by groupby. Specifically, the rolling operation is broken down into three parts. 1.) Preprocess incoming aggregation/type pairs, potentially transforming them into different operations. 2.) Perform the rolling window operation on the transformed inputs. 3.) Postprocess the output from the rolling rolling window operation to obtain the final result.

Combined, these two changes dramatically reduce the amount of dispatch and gpu rolling implementation code one has to read through.

The implementation of the collect list rolling operation has been moved into rolling_collect_list.cuh

There are a couple of other things worth mentioning:

Each device rolling operator implements an is_supported() constexpr function which are stripped down, type-specific versions of the old is_rolling_supported() global function. It might be possible to eliminate this with further fundamental type traits. Looking for opinions here.
is_rolling_supported() has been removed from the code, however the various tests relied on it pretty heavily. So for now I just transplanted it into the test code in a common place. It's definitely not an ideal solution, but maybe ok for now.
It might be worth moving the device rolling operators into their own module to further shrink rolling_detail.cuh. Also looking for opinions here.

…e rolling_window interface.

…ad of unique_ptrs.

…o the finalizer.

…ter the finalizer. Refactor groupby to use this mechanism. Change finalizer to not be an abstract class in the interest of keeping it extensible for rolling window and other users.

… groupby.

…g_operator functor.

cpp/src/rolling/rolling_detail.cuh

mythrocks

I think I've gotten my head around this change now. Sorry, it took a little time.

I had a couple of nitpicks, around DeviceRolling*::is_supported() and if constexpr.
I haven't fully grokked where the dictionary specific code in cudf::detail::rolling_window() might be moved. I'm looking forward to reviewing that change.

cpp/src/rolling/rolling_detail.cuh

nvdbaranec · 2021-05-20T15:01:38Z

rerun tests

cpp/src/rolling/rolling_detail.cuh

…ded a note about removing some code once is_valid_aggregation<> gets cleaned up a bit.

nvdbaranec · 2021-05-24T20:10:19Z

@gpucibot merge

Fixes the rolling-window part of #7611. All the rolling window functions return empty results when the input aggregation column is empty, just as they should. But the column types are incorrectly set to match the input type. While this is alright for `[MIN(), MAX(), LEAD(), LAG()]`, it is incorrect for some aggregations: Aggregation | Input Types | Output Type | --------------|----------------------|-----------------------------------| COUNT_VALID | All types | INT32 | COUNT_ALL | All types | INT32 | ROW_NUMBER | All types | INT32 | SUM | Numerics (e.g. INT8) | 64-bit promoted type (e.g. INT64) | SUM | Chrono | Same as input type | SUM | All else | Unsupported | MEAN | Numerics | FLOAT64 | MEAN | Chrono | FLOAT64 | MEAN | All else | Unsupported | COLLECT_LIST | All types T | LIST with child of type T | This mapping is congruent with `cudf::target_type_t` from `<cudf/detail/aggregation/aggregation.hpp>`. This commit corrects the type of the output column that results from an empty input. It adds test for all the combinations listed above. Note: This is dependent on #8158, and should be merged after that is committed. Authors: - MithunR (https://github.com/mythrocks) Approvers: - Nghia Truong (https://github.com/ttnghia) - https://github.com/nvdbaranec - Vyas Ramasubramani (https://github.com/vyasr) URL: #8274

nvdbaranec added 18 commits April 23, 2021 17:13

First pass. Aggregation hierarchy modified. Still need to apply to th…

6cf25ab

…e rolling_window interface.

Enforce rolling_aggregation types for all rolling interfaces.

5e38bef

Merge branch 'branch-0.20' into derived_aggregations

08bf0c2

Change various rolling interfaces to take aggregations directly inste…

18ba94f

…ad of unique_ptrs.

Refactor get_simple_aggregations to use the visitor pattern similar t…

6426541

…o the finalizer.

Merge branch 'branch-0.20' into derived_aggregations

80f47f9

Rough, incomplete draft. Merged with derived aggregations refactor PR.

e98a74f

Add a simple aggregations collector visitor pattern class modelled af…

6e2e3d6

…ter the finalizer. Refactor groupby to use this mechanism. Change finalizer to not be an abstract class in the interest of keeping it extensible for rolling window and other users.

Merge branch 'derived_aggregations' into rolling_refactor

ee4cd4b

Update python bindings for aggregation / rolling_aggregation changes.

bc5030a

Merge branch 'branch-0.20' into derived_aggregations

da0eea2

Python formatting.

a7f38b6

More python formatting.

000f459

Merge branch 'derived_aggregations' into rolling_refactor

2b5e620

Make the finalize() function const.

307cce9

Refactor the code to use the collector -> finalizer method similar to…

d585a9f

… groupby.

Merge branch 'derived_aggregations' into rolling_refactor

fb417c7

Documentation pass.

beab7c6

nvdbaranec added libcudf Affects libcudf (C++/CUDA) code. improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels May 4, 2021

nvdbaranec requested review from a team as code owners May 4, 2021 19:58

nvdbaranec requested review from cwharris, hyperbolic2346 and galipremsagar May 4, 2021 19:58

github-actions bot added the Python Affects Python cuDF API. label May 4, 2021

nvdbaranec requested review from jrhemstad and mythrocks May 4, 2021 19:58

nvdbaranec removed the Python Affects Python cuDF API. label May 4, 2021

nvdbaranec added 2 commits May 13, 2021 10:39

Minor review comments.

d553680

Simplify SFINAE and specialization logic using a corresponding_rollin…

9da7619

…g_operator functor.

mythrocks reviewed May 13, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Outdated Show resolved Hide resolved

mythrocks reviewed May 13, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Outdated Show resolved Hide resolved

mythrocks requested changes May 14, 2021

View reviewed changes

Grammar change.

1bad5ea

nvdbaranec requested review from hyperbolic2346 and mythrocks May 14, 2021 22:31

rwlee mentioned this pull request May 17, 2021

Add window rank and dense rank functionality #8138

Closed

hyperbolic2346 approved these changes May 17, 2021

View reviewed changes

Merge branch 'branch-21.06' into rolling_refactor

b4a5f15

harrism assigned nvdbaranec May 18, 2021

nvdbaranec requested a review from jrhemstad May 18, 2021 14:56

mythrocks reviewed May 18, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Show resolved Hide resolved

mythrocks reviewed May 18, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Show resolved Hide resolved

mythrocks mentioned this pull request May 18, 2021

Fix result column types for empty inputs to rolling window #8274

Merged

mythrocks approved these changes May 18, 2021

View reviewed changes

jrhemstad reviewed May 20, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Outdated Show resolved Hide resolved

mythrocks reviewed May 21, 2021

View reviewed changes

cpp/src/rolling/rolling_detail.cuh Outdated Show resolved Hide resolved

Add some extra parentheses to DeviceRolling::is_supported() logic. Ad…

1b22c91

…ded a note about removing some code once is_valid_aggregation<> gets cleaned up a bit.

jrhemstad approved these changes May 24, 2021

View reviewed changes

rapids-bot bot merged commit 691dd11 into rapidsai:branch-21.06 May 24, 2021

sameerz mentioned this pull request May 28, 2021

[FEA] Support rank as window function NVIDIA/spark-rapids#1584

Closed

GregoryKimball mentioned this pull request Jul 1, 2022

[BUG] cleanup template code of cpp/src/rolling/rolling.cu #5466

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor of rolling_window implementation. #8158

Refactor of rolling_window implementation. #8158

nvdbaranec commented May 4, 2021 •

edited

Loading

mythrocks left a comment

nvdbaranec commented May 20, 2021

nvdbaranec commented May 24, 2021

Refactor of rolling_window implementation. #8158

Refactor of rolling_window implementation. #8158

Conversation

nvdbaranec commented May 4, 2021 • edited Loading

mythrocks left a comment

Choose a reason for hiding this comment

nvdbaranec commented May 20, 2021

nvdbaranec commented May 24, 2021

nvdbaranec commented May 4, 2021 •

edited

Loading