Faster, more correct complex^complex #24570

stevengj · 2017-11-11T04:52:43Z

This PR fixes the incorrect behaviors for complex^complex identified in #24515, and it also makes the code significantly faster without (as far as I can tell) sacrificing accuracy. On my machine, it is around 60% faster for complex^complex and 120% faster for real^complex in double precision.

The old code had two completely separate implementations, one for floating-point types and one for other types, despite the fact that both produced floating-point results. The floating-point version was based on z^p = exp(p * log(z)), whereas the other version first converted z to polar form and then exponentiated. The latter approach seems to be significantly faster and no less accurate, so I now use that in all cases (with various special-case optimizations for real z and/or p). By unifying the implementations, the code is also significantly shorter.

~~Marked as breaking only because this throws exceptions in some cases where the old code would have silently returned NaNs.~~ Returns NaN as before.

See also the discussions in #2891, #3246, 06530b6.

To do:

Tests: need to cover all of the branches and edge cases.
Fixes/tests for Inf and NaN cases
Decide whether to throw DomainError for (0.0+0.0im)^-1.0 and other cases ala NaN vs wild (or, what's a DomainError, really?) #5234

oscardssmith · 2017-11-11T07:51:54Z

Any chance of getting nice error plots to make sure that error stays below x ulps?

StefanKarpinski · 2017-11-11T16:55:03Z

base/complex.jl

+function _cpow(z::Union{T,Complex{T}}, p::Union{T,Complex{T}}) where {T<:AbstractFloat}
+    if isreal(p)
+        pᵣ = real(p)
+        if isinteger(pᵣ) && abs(pᵣ) < typemax(Int32)


Maybe write it as exp2(31) instead? It is jarring to see integer types in here.

But typemax(Int32) is literally what we want to prevent overflow on 32-bit machines, and to have consistent behavior on 64-bit. The whole point of this if statement is to convert to an integer type.

stevengj · 2017-11-11T18:47:00Z

@oscardssmith, some plots of the old and new relative errors for z^w and w^z for a variety of z (in double precision) are shown below: the accuracy seems very similar. I should also note that the polar formula (used in my new code) is how CPython computes it and is also how R computes it, so we are in pretty good company.

The main trick is to get the edge cases (especially the sign of zero imaginary parts) correct, especially when you want to take advantage of real z and/or p for z^p.

stevengj · 2017-11-12T03:25:42Z

base/complex.jl

-                end
-            end
+    elseif isreal(z)
+        iszero(z) && return real(p) > 0 ? complex(z) : Complex(T(NaN),T(NaN)) # 0 or NaN+NaN*im


@StefanKarpinski, is the policy these days to throw DomainError rather than constructing NaN explicitly, if it is practical to do so? inv(0.0+0.0im) (and hence (0.0+0.0im)^-1) doesn't throw an error, but I guess that is because it would slow it down too much to check for this?

I'm a little confused because it doesn't seem like #5234 was ever resolved.

giordano · 2017-11-12T11:16:13Z

What are the white points in the plots?

stevengj · 2017-11-12T12:07:31Z

@giordano, the white points are where the error is "zero" (i.e. the result, by luck, is exactly rounded), so log of the error is –∞.

Note that the error is unlikely to be actually zero for the white points, it is just < 1ulp, so probably they should be bluish or greenish. Here are the plots re-done with the true error (i.e. I compute the error and then round from BigFloat to Float64 rather than the other way around), making sure the color scales match:

giordano · 2017-11-12T12:17:02Z

Uhm, to me the current implementation seems to have more white points, on the other hand the proposed one looks more blueish, which is good anyway.

stevengj · 2017-11-12T22:54:12Z

If I change this to throw exceptions rather than creating NaNs from non-NaN inputs, that will technically be a breaking change. Again, I'm not sure what the desired behavior is these days?

stevengj · 2017-11-13T03:13:35Z

By the way, one argument against throwing DomainError from this function: if we want to catch all cases where non-NaN inputs can generate NaN results, there are a whole bunch of additional overflow circumstances etc. that need to be checked.

Also, z^p for p::Integer does not throw, because power_by_squaring does no NaN checks. e.g. (Inf+Inf*im)^3 produces NaN + NaN*im.

Are we okay with throwing DomainError in some NaN-producing cases but not all? The alternatives are:

add a lot more checks (making the code more messy and possibly slower) … it's pretty hard to be sure we've covered everything, too
or never throw (just silently return NaNs).

This reverts commit b83e3a5.

stevengj · 2017-11-13T14:32:44Z

After sleeping on it, I've decided to revert the breaking change of throwing DomainError. Rationale:

This makes the PR strictly more conservative, so throwing errors can always be added in a separate PR.
It didn't make sense to me to throw DomainError sometimes and generate NaN sometimes. It should be one or the other.
Finding all cases where non-NaN inputs produce NaN would greatly complicate the code, and would make it inconsistent with integer powers (power_by_squaring).

JeffBezanson · 2017-11-13T21:17:29Z

We don't have a general rule that getting NaNs from non-NaN inputs should be a DomainError. That pattern was introduced just for a couple functions where that check happens to correctly identify DomainError cases.

stevengj · 2017-11-13T21:33:52Z

CI looks good except AppVeyor, where the error Can not open the file as [PE] archive looks unrelated.

pkofod · 2017-11-13T21:56:35Z

(i.e. I compute the error and then round from BigFloat to Float64 rather than the other way around)

Just to be sure does this mean that you are converting the Float64 result to BigFloat, subtract a reference BigFloat result, divide by (?), and convert back to Float64?

stevengj · 2017-11-13T22:04:48Z

Just to be sure does this mean that you are converting the Float64 result to BigFloat, subtract a reference BigFloat result, divide by (?), and convert back to Float64?

Essentially yes. To compute the relative error for the Complex{Float64} value x = z^p in the updated plots, I first get the "exact" result e = big(z)^big(p), then compute Float64(abs(x - e) / abs(e)). The x-e I think may call a BigFloat-Float64 MPFR function to do the subtraction directly without first converting x to Complex{BigFloat}, however.

stevengj · 2017-11-15T17:32:15Z

Should be good to merge?

KristofferC · 2017-11-15T17:33:51Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

stevengj · 2017-11-16T13:05:54Z

Is nanosoldier working? It's been "running on node 3" for a day now.

KristofferC · 2017-11-16T13:07:10Z

The only thing we can conclude is that this PR brings some serious performance regressions ;)

stevengj · 2017-11-20T17:53:35Z

Bump. Seems like there is no point in running nanosoldier right now...

ararslan · 2017-11-23T06:26:24Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2017-11-23T07:28:11Z

Something went wrong when running your job:

NanosoldierError: failed to run benchmarks against primary commit: failed process: Process(`sudo cset shield -e su nanosoldier -- -c ./benchscript.sh`, ProcessExited(1)) [1]

Logs and partial data can be found here
cc @ararslan

stevengj · 2017-11-23T12:28:05Z

Looks like nanosoldier is not working; seems like an unrelated ProcessExited(1) exception since it occurs before of the be benchmarks start running.

ararslan · 2017-11-23T19:25:51Z

Okay, not sure yet what the deal is. In the meantime you can benchmark manually with BenchmarkTools, since the macros work on 0.7 again now.

stevengj · 2017-12-05T15:10:50Z

Let's see if nanosoldier works now:

@nanosoldier runbenchmarks(ALL, vs=":master")

(Though it has been so noisy recently, even when it works, that the results have been hard to make use of.)

stevengj · 2017-12-06T21:13:37Z

@ararslan, what's up with nanosoldier?

ararslan · 2017-12-06T21:23:36Z

It's been acting up a bit lately. I think in this case I restarted the server after you had triggered a run, which means that the status for that run didn't get updated. I'll try retriggering.

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2017-12-07T01:22:59Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

stevengj · 2017-12-07T02:19:51Z

The nanosoldier regressions look like noise.

(It doesn't look like there are any benchmarks in BaseBenchmarks that exercise complex powers.)

stevengj · 2017-12-13T20:07:40Z

Seems ready to merge?

stevengj · 2017-12-14T16:38:41Z

Yay, CI is green.

stevengj added 4 commits November 10, 2017 23:40

unify complex power code (fixes JuliaLang#24515)

2a6172a

fix sign of imaginary part for complex 0^0

a72f88a

usually faster for cpow to convert to float at the beginning

25f57c6

add real^complex specialization

afc3377

stevengj added complex Complex numbers needs tests Unit tests are required for this change labels Nov 11, 2017

another sign fix

a094e23

StefanKarpinski reviewed Nov 11, 2017

View reviewed changes

stevengj added 4 commits November 11, 2017 15:12

accuracy improvement via cospi, sinpi, sign fix for real^complex

3c2ffbd

more comments on signs

8d67731

more cpow tests

f0cd0c2

real^complex promotion, sign fix

0127a4c

stevengj commented Nov 12, 2017

View reviewed changes

better Inf and NaN handling

e2fb636

stevengj removed the needs tests Unit tests are required for this change label Nov 13, 2017

stevengj changed the title ~~WIP: Faster, more correct complex^complex~~ Faster, more correct complex^complex Nov 13, 2017

slight clarification

10e11a5

stevengj added the breaking This change will break code label Nov 13, 2017

NEWS for breaking change

b83e3a5

stevengj added 4 commits November 12, 2017 23:10

fix type instability

cb1a52c

check inferred type

ea9b4a4

Revert "NEWS for breaking change"

f78e032

This reverts commit b83e3a5.

return NaNs rather than throwing sometimes

90f2bc5

rm redundant check

c575212

jrevels mentioned this pull request Nov 13, 2017

Error trying to differentiate through a Complex JuliaDiff/ForwardDiff.jl#239

Open

fix method ambiguity

28d9c0d

StefanKarpinski closed this Dec 13, 2017

StefanKarpinski reopened this Dec 13, 2017

StefanKarpinski merged commit 2f64a3b into JuliaLang:master Dec 14, 2017

stevengj deleted the cpow branch December 14, 2017 19:56

stevengj mentioned this pull request Dec 14, 2017

incorrect (Inf + Inf*im)^2.0 and other complex^float #24515

Closed

jwmerrill mentioned this pull request Jun 5, 2024

Over/underflow in complex power function #54692

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster, more correct complex^complex #24570

Faster, more correct complex^complex #24570

stevengj commented Nov 11, 2017 •

edited

Loading

oscardssmith commented Nov 11, 2017

StefanKarpinski Nov 11, 2017

stevengj Nov 11, 2017 •

edited

Loading

stevengj commented Nov 11, 2017 •

edited

Loading

stevengj Nov 12, 2017 •

edited

Loading

giordano commented Nov 12, 2017

stevengj commented Nov 12, 2017 •

edited

Loading

giordano commented Nov 12, 2017

stevengj commented Nov 12, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading

JeffBezanson commented Nov 13, 2017

stevengj commented Nov 13, 2017 •

edited

Loading

pkofod commented Nov 13, 2017

stevengj commented Nov 13, 2017 •

edited

Loading

stevengj commented Nov 15, 2017

KristofferC commented Nov 15, 2017

stevengj commented Nov 16, 2017

KristofferC commented Nov 16, 2017

stevengj commented Nov 20, 2017

ararslan commented Nov 23, 2017

nanosoldier commented Nov 23, 2017

stevengj commented Nov 23, 2017

ararslan commented Nov 23, 2017

stevengj commented Dec 5, 2017

stevengj commented Dec 6, 2017

ararslan commented Dec 6, 2017

nanosoldier commented Dec 7, 2017

stevengj commented Dec 7, 2017

stevengj commented Dec 13, 2017

stevengj commented Dec 14, 2017

Faster, more correct complex^complex #24570

Faster, more correct complex^complex #24570

Conversation

stevengj commented Nov 11, 2017 • edited Loading

oscardssmith commented Nov 11, 2017

StefanKarpinski Nov 11, 2017

Choose a reason for hiding this comment

stevengj Nov 11, 2017 • edited Loading

Choose a reason for hiding this comment

stevengj commented Nov 11, 2017 • edited Loading

stevengj Nov 12, 2017 • edited Loading

Choose a reason for hiding this comment

giordano commented Nov 12, 2017

stevengj commented Nov 12, 2017 • edited Loading

giordano commented Nov 12, 2017

stevengj commented Nov 12, 2017 • edited Loading

stevengj commented Nov 13, 2017 • edited Loading

stevengj commented Nov 13, 2017 • edited Loading

JeffBezanson commented Nov 13, 2017

stevengj commented Nov 13, 2017 • edited Loading

pkofod commented Nov 13, 2017

stevengj commented Nov 13, 2017 • edited Loading

stevengj commented Nov 15, 2017

KristofferC commented Nov 15, 2017

stevengj commented Nov 16, 2017

KristofferC commented Nov 16, 2017

stevengj commented Nov 20, 2017

ararslan commented Nov 23, 2017

nanosoldier commented Nov 23, 2017

stevengj commented Nov 23, 2017

ararslan commented Nov 23, 2017

stevengj commented Dec 5, 2017

stevengj commented Dec 6, 2017

ararslan commented Dec 6, 2017

nanosoldier commented Dec 7, 2017

stevengj commented Dec 7, 2017

stevengj commented Dec 13, 2017

stevengj commented Dec 14, 2017

stevengj commented Nov 11, 2017 •

edited

Loading

stevengj Nov 11, 2017 •

edited

Loading

stevengj commented Nov 11, 2017 •

edited

Loading

stevengj Nov 12, 2017 •

edited

Loading

stevengj commented Nov 12, 2017 •

edited

Loading

stevengj commented Nov 12, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading

stevengj commented Nov 13, 2017 •

edited

Loading