`fixed_point` + `cudf::binary_operation` API Changes #7435

codereport · 2021-02-24T14:09:50Z

This resolves #7442

Recently while working with @razajafri on fixed_point binary ops, it became clear that the cudf::binary_operation is breaking the "easy to use, hard to misuse" # 1 design guideline. I knew about this but I slotted it as technical debt to be cleaned up later. Long story short, after discussions with both @razajafri, @jrhemstad and comments on the #7442, we will implement the following:

For fixed_point + cudf::binary_operation + DIV always use the cudf::data_type output_type parameter
~~For fixed_point + cudf::binary_operation + TRUE_DIV, require that the columns/scalars provided as arguments (lhs and rhs) will result in the specified data_type/scale~~
Provide a convenience function (something like binary_operation_fixed_point_scale()) that will compute the "expected" scale given two input columns/scalars and a binary_operator
Remove TRUE_DIV
Add unit tests for different output data_types
Update Python/Cython

This will be a breaking change for all fixed_point + cudf::binary_operation.

razajafri · 2021-02-25T06:11:15Z

Thanks for the quick turn around

codereport · 2021-02-25T06:19:01Z

Thanks for the quick turn around

Well this PR is going to be modified based on the proposal laid out here: #7442. Post any comments if you have them.

This reverts commit 0f528de.

This reverts commit d07c35f.

codecov · 2021-02-25T23:31:26Z

Codecov Report

Merging #7435 (43c2def) into branch-0.19 (53929eb) will increase coverage by 0.42%.
The diff coverage is 92.85%.

@@               Coverage Diff               @@
##           branch-0.19    #7435      +/-   ##
===============================================
+ Coverage        81.88%   82.30%   +0.42%     
===============================================
  Files              101      101              
  Lines            16900    17273     +373     
===============================================
+ Hits             13838    14216     +378     
+ Misses            3062     3057       -5

Impacted Files	Coverage Δ
python/cudf/cudf/core/column/decimal.py	`94.73% <90.00%> (-1.10%)`	⬇️
python/cudf/cudf/core/dtypes.py	`91.13% <100.00%> (+0.86%)`	⬆️
python/cudf/cudf/utils/dtypes.py	`89.51% <100.00%> (ø)`
python/cudf/cudf/io/feather.py	`100.00% <0.00%> (ø)`
python/cudf/cudf/comm/serialize.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_fuzz_testing/io.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/struct.py	`100.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/_version.py	`0.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/io/tests/test_csv.py	`100.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/io/tests/test_orc.py	`100.00% <0.00%> (ø)`
... and 40 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 53929eb...43c2def. Read the comment docs.

codereport · 2021-03-08T18:04:26Z

I am not able to get a BOOL8 column when comparing Decimal columns with a scale of 0. Other scales are working as expected.

dec64_c1.binaryOp(EQUAL, dec64_c2, BOOL8) => Decimal64 column

I have tried other predicates and they are also returning Decimal64 col. Can you please check if this is a regression in this PR?

Fixed! This was an awesome catch! Thank you!

cpp/src/binaryop/binaryop.cpp

hyperbolic2346

Looks good overall here. I think this cleans things up.

cpp/src/binaryop/binaryop.cpp

cpp/tests/binaryop/binop-integration-test.cpp

python/cudf/cudf/core/dtypes.py

codereport · 2021-03-10T03:46:37Z

rerun tests

razajafri · 2021-03-12T23:35:37Z

@shwina @trxcllnt @brandon-b-miller @jrhemstad Can we get this going as my other PRs are blocked by this.

shwina · 2021-03-12T23:44:48Z

@gpucibot merge

@codereport

@codereport is making changes to the way `DIV` will behave for fixed-point types #7435. This PR contains Java changes to support those changes. Note: This is a draft until #7435 is merged Authors: - Raza Jafri (@razajafri) Approvers: - MithunR (@mythrocks) - Jason Lowe (@jlowe) - Gera Shegalov (@gerashegalov) URL: #7527

@razajafri

This resolves rapidsai#7442 Recently while working with @razajafri on `fixed_point` binary ops, it became clear that the `cudf::binary_operation` is breaking the "easy to use, **hard to misuse**" # 1 design guideline. I knew about this but I slotted it as technical debt to be cleaned up later. Long story short, after discussions with both @razajafri, @jrhemstad and comments on the rapidsai#7442, we will implement the following: * [x] For `fixed_point` + `cudf::binary_operation` + `DIV` always **use** the `cudf::data_type output_type` parameter * [x] ~~For `fixed_point` + `cudf::binary_operation` + `TRUE_DIV`, require that the columns/scalars provided as arguments (`lhs` and `rhs`) will result in the specified `data_type`/`scale`~~ * [x] Provide a convenience function (something like `binary_operation_fixed_point_scale()`) that will compute the "expected" scale given two input columns/scalars and a `binary_operator` * [x] Remove `TRUE_DIV` * [x] Add unit tests for different output data_types * [x] Update Python/Cython **This will be a breaking change for all `fixed_point` + `cudf::binary_operation`.** Authors: - Conor Hoekstra (@codereport) Approvers: - Keith Kraus (@kkraus14) - Mike Wilson (@hyperbolic2346) URL: rapidsai#7435

Initial changes for binary_v_v

d07c35f

codereport added 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. tech debt improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 24, 2021

codereport self-assigned this Feb 24, 2021

Fix

0f528de

codereport mentioned this pull request Feb 24, 2021

[IMPR] fixed_point + cudf::binary_operation API Changes #7442

Closed

codereport added 3 commits February 25, 2021 13:54

Revert "Fix"

223fe40

This reverts commit 0f528de.

Revert "Initial changes for binary_v_v"

f8ec02b

This reverts commit d07c35f.

Require TRUE_DIV scale to be == to lhs - rhs scales

e0bf2c6

codereport changed the title ~~Add error for fixed_point cudf::binary_operation specified output_type parameter~~ fixed_point + cudf::binary_operation API Changes Feb 25, 2021

codereport added 8 commits February 25, 2021 22:02

Use output_type

2476eab

Remove TRUE_DIV

ba38c80

Don't hardcode BOOL

f1a62b3

Add unit tests + CUDF_EXPECTS

c0769f1

Remove dead code

65d071f

Merge branch 'branch-0.19' into binaryop-error

0c8b7e0

Fix tests

83070e8

Fix rescale

4860f22

razajafri mentioned this pull request Mar 3, 2021

Remove support for TRUE_DIV in Java bindings #7506

Closed

Python changes (incomplete)

aa2b5fe

github-actions bot added the Python Affects Python cuDF API. label Mar 4, 2021

codereport added 2 commits March 4, 2021 10:41

Python fix

e76979b

Black python formatting

0f9a043

Fix + unit test for same scale comparison op

f86f720

Use MAX_PRECISION

ff540c3

hyperbolic2346 reviewed Mar 8, 2021

View reviewed changes

cpp/src/binaryop/binaryop.cpp Show resolved Hide resolved

hyperbolic2346 requested changes Mar 8, 2021

View reviewed changes

cpp/src/binaryop/binaryop.cpp Outdated Show resolved Hide resolved

cpp/src/binaryop/binaryop.cpp Outdated Show resolved Hide resolved

cpp/src/binaryop/binaryop.cpp Outdated Show resolved Hide resolved

cpp/tests/binaryop/binop-integration-test.cpp Show resolved Hide resolved

Use more declarative ternary operator

98f4411

codereport requested a review from hyperbolic2346 March 9, 2021 00:09

Use absolute path and remove local import

8178320

codereport requested review from shwina and brandon-b-miller March 9, 2021 00:43

black / flake8 fix

c638707

codereport commented Mar 9, 2021

View reviewed changes

python/cudf/cudf/core/dtypes.py Show resolved Hide resolved

Use MAX_PRECISION

efe92a3

kkraus14 approved these changes Mar 9, 2021

View reviewed changes

codereport added 3 commits March 8, 2021 23:30

Unit tests

a1e3887

Merge branch 'branch-0.19' into binaryop-error

440bf52

Merge branch 'branch-0.19' into binaryop-error

43c2def

hyperbolic2346 approved these changes Mar 10, 2021

View reviewed changes

codereport added breaking Breaking change and removed non-breaking Non-breaking change labels Mar 11, 2021

rapids-bot bot merged commit 04f9021 into rapidsai:branch-0.19 Mar 12, 2021

vyasr removed the 4 - Needs cuDF (Python) Reviewer label Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`fixed_point` + `cudf::binary_operation` API Changes #7435

`fixed_point` + `cudf::binary_operation` API Changes #7435

codereport commented Feb 24, 2021 •

edited

Loading

razajafri commented Feb 25, 2021

codereport commented Feb 25, 2021

codecov bot commented Feb 25, 2021 •

edited

Loading

codereport commented Mar 8, 2021

hyperbolic2346 left a comment

codereport commented Mar 10, 2021

razajafri commented Mar 12, 2021

shwina commented Mar 12, 2021

fixed_point + cudf::binary_operation API Changes #7435

fixed_point + cudf::binary_operation API Changes #7435

Conversation

codereport commented Feb 24, 2021 • edited Loading

razajafri commented Feb 25, 2021

codereport commented Feb 25, 2021

codecov bot commented Feb 25, 2021 • edited Loading

Codecov Report

codereport commented Mar 8, 2021

hyperbolic2346 left a comment

Choose a reason for hiding this comment

codereport commented Mar 10, 2021

razajafri commented Mar 12, 2021

shwina commented Mar 12, 2021

`fixed_point` + `cudf::binary_operation` API Changes #7435

`fixed_point` + `cudf::binary_operation` API Changes #7435

codereport commented Feb 24, 2021 •

edited

Loading

codecov bot commented Feb 25, 2021 •

edited

Loading