Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support equally comparing two different tables in the newly experimental row comparator #10667

Conversation

ttnghia
Copy link
Contributor

@ttnghia ttnghia commented Apr 14, 2022

The new row comparator (#10164) only added a self_comparator utility to compare rows of the same table. This PR adds equality::table_comparator that allows equally comparing rows from different tables. Immediate usage of such comparator is for linearly searching a struct scalar in a structs column.

In addition, it also fixes the comparator that produces wrong results when the input is sliced.

This PR is required for #10548 and #10656.

@ttnghia ttnghia added bug Something isn't working 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Apr 14, 2022
@ttnghia ttnghia requested a review from a team as a code owner April 14, 2022 21:47
@ttnghia ttnghia self-assigned this Apr 14, 2022
@ttnghia
Copy link
Contributor Author

ttnghia commented Apr 14, 2022

@jrhemstad @devavret The new equality comparator seems to be incorrect for sliced input. I still didn't figure out what's wrong. If you uncover anything, please let me know.

@ttnghia ttnghia removed the improvement Improvement / enhancement to an existing function label Apr 14, 2022
@ttnghia ttnghia changed the title Support comparing two different tables in the newly experimental row comparator Support equally comparing two different tables in the newly experimental row comparator Apr 14, 2022
@jrhemstad
Copy link
Contributor

@ttnghia this work is actually already under way. See #10508

CC @bdice

@codecov

This comment was marked as off-topic.

@ttnghia
Copy link
Contributor Author

ttnghia commented Apr 15, 2022

@bdice If you are working on similar thing, please apply this patch: https://github.com/rapidsai/cudf/pull/10667/files#diff-0743b70282a92785dee8e55adc4099f501b4ed6c20c21e39fe187b2f4f0664a0L560-R562. It is fixing the bug when the input column is sliced.

@ttnghia
Copy link
Contributor Author

ttnghia commented Apr 15, 2022

In addition, I'm still not sure if this (https://github.com/rapidsai/cudf/pull/10667/files#diff-0743b70282a92785dee8e55adc4099f501b4ed6c20c21e39fe187b2f4f0664a0R573-R574) also needs to be fixed by the same way. This needs to be tested too.

@github-actions github-actions bot added the CMake CMake build issue label Apr 18, 2022
This reverts commit a8a2788.
@github-actions github-actions bot removed the CMake CMake build issue label Apr 19, 2022
@ttnghia ttnghia closed this May 12, 2022
@ttnghia ttnghia deleted the adopt_new_row_comparator_for_struct_search branch May 13, 2022 23:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 - In Progress Currently a work in progress bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change Spark Functionality that helps Spark RAPIDS
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants