[FEA] Add libcudf any() and all() reductions #1874

harrism · 2019-05-29T00:38:16Z

Is your feature request related to a problem? Please describe.

Currently cuDF Python implements any() and all() for a column by calling libcudf's min() and max() (see NVIDIA/thrust#1863). libcudf should provide any() and all() instead. They can be made faster and also give us better control over semantics.

Describe the solution you'd like
It would be faster to provide an optimized any() and all() that first check that the values are non-zero to get a boolean per element and then use bitwise reductions in CUDA for a faster reduction.

Describe alternatives you've considered
It currently works, so this is just a potential optimization.

harrism · 2019-05-29T00:38:51Z

@kovaltan FYI in case you have thoughts on this.

kovaltan · 2019-05-29T03:30:28Z

I agree to have any and all function in cudf for better control over semantics.
But I suspect about the potential optimization.

It would be faster to provide an optimized any() and all() that first check that the values are non-zero to get a boolean per element and then use bitwise reductions in CUDA for a faster reduction.

The current implementation of reduction has already output dtype option.
For non boolean column, if we set output_dtype = bool8 and do reduction min (for all), or max (for any),
the reducntion kernel cast a element into cudf::bool and then do reduction for rows.
The current implementation uses 1 byte atomicMin for cudf::bool reduction, there would be a room for optimization.
But in future, I'm thinking to re-implement reduction using null supported iterator given by PR NVIDIA/thrust#1833, then the implementation will use cub and it would be well optimised.

Btw, I've found a bug about all and any of integer column, and fixed it, I will file it later.

harrism · 2019-05-30T04:37:44Z

In my experience even CUB doesn't have an optimized 1-bit-per-boolean reduction (i.e. using __popc() or __syncthreads_and()/__syncthreads_or().

rgsl888prabhu · 2019-10-14T15:51:54Z

In my experience even CUB doesn't have an optimized 1-bit-per-boolean reduction (i.e. using __popc() or __syncthreads_and()/__syncthreads_or().

@harrism Why not use thrust::all_of and thrust::any_of ?

jrhemstad · 2019-10-14T15:58:21Z

In my experience even CUB doesn't have an optimized 1-bit-per-boolean reduction (i.e. using __popc() or __syncthreads_and()/__syncthreads_or().

@harrism Why not use thrust::all_of and thrust::any_of ?

thrust::all_of/any_of are slow and should not be used. See: https://github.com/thrust/thrust/issues/1016

karthikeyann · 2019-10-16T20:02:21Z

faster early out for any_of and all_of
https://github.com/thrust/thrust/issues/1016#issuecomment-542867313

jrhemstad · 2019-10-16T20:16:54Z

faster early out for any_of and all_of
thrust/thrust#1016 (comment)

@karthikeyann Your results in that issue are fishy/impossible.

karthikeyann · 2019-10-16T20:29:22Z

Yes. looked into it. it had a bug. updating it now.

harrism added feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. labels May 29, 2019

kovaltan mentioned this issue May 29, 2019

[REVIEW] Bug fix: add dtype for any, all #1876

Merged

rgsl888prabhu self-assigned this Oct 7, 2019

rgsl888prabhu mentioned this issue Oct 15, 2019

[REVIEW] Adding any and all support from libcudf #3094

Merged

shwina closed this as completed in #3094 Oct 16, 2019

karthikeyann reopened this Oct 16, 2019

karthikeyann self-assigned this Oct 16, 2019

karthikeyann closed this as completed Oct 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Add libcudf any() and all() reductions #1874

[FEA] Add libcudf any() and all() reductions #1874

harrism commented May 29, 2019 •

edited

Loading

harrism commented May 29, 2019

kovaltan commented May 29, 2019

harrism commented May 30, 2019 •

edited

Loading

rgsl888prabhu commented Oct 14, 2019 •

edited

Loading

jrhemstad commented Oct 14, 2019

karthikeyann commented Oct 16, 2019

jrhemstad commented Oct 16, 2019

karthikeyann commented Oct 16, 2019

[FEA] Add libcudf any() and all() reductions #1874

[FEA] Add libcudf any() and all() reductions #1874

Comments

harrism commented May 29, 2019 • edited Loading

harrism commented May 29, 2019

kovaltan commented May 29, 2019

harrism commented May 30, 2019 • edited Loading

rgsl888prabhu commented Oct 14, 2019 • edited Loading

jrhemstad commented Oct 14, 2019

karthikeyann commented Oct 16, 2019

jrhemstad commented Oct 16, 2019

karthikeyann commented Oct 16, 2019

harrism commented May 29, 2019 •

edited

Loading

harrism commented May 30, 2019 •

edited

Loading

rgsl888prabhu commented Oct 14, 2019 •

edited

Loading