[C++] Add element-wise power() compute function #27714

asfimport · 2021-03-05T11:16:33Z

It would be nice to have an element-wise power() compute function.

I.e. in analogy to numpy.power().

Reporter: ARF / @ARF1

Related issues:

[C++] Arithmetic kernels for numeric arrays (is a child of)
[C++] Implement power / exponentiation compute kernel (is duplicated by)

_{Note: This issue was originally created as ARROW-11871. Please see the migration documentation for further details.}

The text was updated successfully, but these errors were encountered:

asfimport · 2021-03-08T13:39:36Z

Joris Van den Bossche / @jorisvandenbossche:
Adding a power kernel would indeed be nice.

One behavioural aspect that has come up in pandas is the question about what to do with nulls in case of power(null, 0) or power(1, null: propagate the null value (as is otherwise always done for element-wise arithmetic operations), or in this case return an actual result (1 in both cases). Reference to the pandas issue: pandas-dev/pandas#29997

asfimport · 2021-03-08T15:19:40Z

ARF / @ARF1:
@jorisvandenbossche For what it's worth, in my book null != 0. To me null has absolutely nothing to do with the value 0. To me null indicates an invalid or non-existent value and the name null is merely a (maybe unfortunate) historical artifact.

As a consequence in my opinion, the following should hold: power(null, 0) == null as well as power(1, null) == null.
I read this as: (either) one of two operands of a binary operator is invalid or missing, hence the result is invalid or missing as well.

With this convention, if a user wants a different behaviour they can always use fill_null(0) to ensure that power(fill_null(null, 0), 0) == 1 and power(1, fill_null(null, 0)) == 1. The converse is not true.

Also, I believe explicit is better than implicit...

asfimport · 2021-03-08T16:57:34Z

Joris Van den Bossche / @jorisvandenbossche:
To clarify, my comment above was not about null being regarded as 0. But rather the interpretation that null is seen as "some unknown value". And then you can argue that the result is is not unknown for power(null, 0), because power(<any value>, 0) is always 1, whatever value is passed as the first argument.

asfimport · 2021-03-08T17:27:12Z

Neal Richardson / @nealrichardson:
This is a duplicate of ARROW-11070 right?

asfimport · 2021-03-08T17:32:11Z

Joris Van den Bossche / @jorisvandenbossche:
Indeed!

asfimport · 2021-03-08T19:30:20Z

ARF / @ARF1:
@jorisvandenbossche Thanks for your explanations. I accept that in end this depends on the semantics of null. Two different programmers can legitimately understand null to mean different things. How a programmer understands null dictates the correct behaviour of power().

A programmer that understands null in an array to mean "this is a value of the defined datatype, I just don't know what it is" will expect power(<any value including any unknown value>, 0) == 1.

A programmer that understands null in an array to mean "this value does not exist, it is fundamentally invalid" will expect power(null, 0) == null.

It would seem to me the solution is to leave the choice to the user and allow her/him to specify the desired behaviour as an option to power. Then the debate becomes only "what should the default behaviour be?" ;-) In this case, I would reverse my opinion and would argue at least for pyarrow to default to the python behaviour of float('NaN')**0.0 == 1.

If arrow has to specify a unique semantic interpretation of null and cannot allow user choice, I believe however power(null, 0) == null is the better choice due to greater versatility: As I tried to explain in my previous comment, this interpretation allows to user to obtain the alternative behaviour by using power(fill_null(null, 0), 0) == 1.

Conversely if arrow standardized on power(null, 0) == 1 there is nothing the user can do to get the alternative behaviour. Once a value becomes non-null, there is no way to recover its original null-ness.

Please feel free to close this issue as a duplicate. I searched for issues relating to power and did not find ARROW-11070.

asfimport · 2021-03-10T16:25:01Z

Joris Van den Bossche / @jorisvandenbossche:
Yes, I already closed it.

It would seem to me the solution is to leave the choice to the user and allow her/him to specify the desired behaviour as an option to power.

Indeed, if there are different downstream applications that might need either behaviour, an option might be best. But so that's the main reason I brought up the issue.

asfimport closed this as completed Mar 8, 2021

This was referenced Jan 11, 2023

[C++] Implement power / exponentiation compute kernel #26983

Closed

[C++] Arithmetic kernels for numeric arrays #28490

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] Add element-wise power() compute function #27714

[C++] Add element-wise power() compute function #27714

asfimport commented Mar 5, 2021 •

edited

Loading

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 10, 2021

[C++] Add element-wise power() compute function #27714

[C++] Add element-wise power() compute function #27714

Comments

asfimport commented Mar 5, 2021 • edited Loading

Related issues:

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 8, 2021

asfimport commented Mar 10, 2021

asfimport commented Mar 5, 2021 •

edited

Loading