Get rid of special-casing of ranges in == and isequal() for AbstractArrays #16364

nalimilan · 2016-05-14T08:56:22Z

The AbstractArray default == operator special-cases ranges:

function (==)(A::AbstractArray, B::AbstractArray)
    if size(A) != size(B)
        return false
    end
    if isa(A,Range) != isa(B,Range)
        return false
    end
    for (a, b) in zip(A, B)
        if !(a == b)
            return false
        end
    end
    return true
end

isequal is defined in the same way.

This is obviously not great. A new array type is considered as equal to a standard Array if their elements are all equal, but it is considered as different from a range containing the same elements. This inconsistency is visible even inside Base:

julia> sparse([1,2]) == [1,2]
true

julia> [1,2] == 1:2
false

So it's not clear whether two AbstractArrays need to be of the same type or not to be ==. I can see two consistent solutions:

remove the special-case for ranges (i.e. return true when elements are equal)
change if isa(A,Range) != isa(B,Range) to typeof(A) !== typeof(B)

The second choice would clearly be much more disruptive than the first, and it would make == almost useless for arrays.

The text was updated successfully, but these errors were encountered:

timholy · 2016-05-14T09:00:23Z

#13565

nalimilan · 2016-05-14T09:09:47Z

Ah, funny how I keep bumping into this issue without remembering... :-)

Anyway, should we reopen one of the older issues about hashing ranges? This looks like a significant inconsistency in the language to me (and equality is already complex to master without it...).

timholy · 2016-05-14T09:38:09Z

The concern is that it's an unsolvable problem, in which case reopening the issue won't help.

nalimilan · 2016-05-14T09:53:46Z

Well, a few ideas were advanced here: #12226 (comment)

It would be really sad that == wouldn't give the expected result just to ensure fast hashing of ranges. It seems to me that == is a much more common operation on ranges than hash (let alone hashing float ranges...).

timholy · 2016-05-14T09:55:40Z

👍 Skip the discussion, go straight for the PR, then 😄. EDIT: meaning, fix the hashing algorithm so you get the best of all worlds.

nalimilan · 2016-05-17T09:53:47Z

I didn't expect I would be able to do something about it, but it might be easier than I/we thought. Please comment on #16401.

StefanKarpinski · 2017-01-26T15:50:34Z

I rather suspect this is not going to happen in 0.6.

StefanKarpinski · 2017-02-02T18:55:32Z

Note that this is also not breaking – it's technically an optimization.

nalimilan · 2017-02-02T19:43:42Z

Note that this is also not breaking – it's technically an optimization.

You mean that we should make ranges and arrays hash equal by iterating over all of their elements, and try to optimize this later?

StefanKarpinski · 2017-02-02T21:18:35Z

Ah yes, I thought that was what we were already doing, actually!

StefanKarpinski · 2017-02-02T21:18:54Z

I kind of suspect that hashing ranges is not all that common, tbh.

nalimilan · 2017-02-03T09:11:40Z

Yes, that's also my opinion. == is much more useful.

mbauman · 2017-02-15T20:15:34Z

So should we squeeze the breaking portion of this into 0.6? It's a very easy change to make — just delete some code and fix a few tests. But changing the algorithmic complexity of hash from O(1) to O(n) could be terribly breaking… if anyone happens to be hashing large ranges their program suddenly becomes (effectively) non-terminating. hash(1:typemax(Int64)) goes from taking under a microsecond to over a hundred millennia.

Or we could alternatively add a depwarn for hash of relatively large ranges.

StefanKarpinski · 2017-02-15T21:10:33Z

I'd vote for leaving this alone until we have a real plan for this.

nalimilan mentioned this issue May 17, 2016

Make arrays and ranges hash and compare equal #16401

Merged

StefanKarpinski added this to the 0.6.0 milestone Sep 13, 2016

StefanKarpinski removed this from the 0.6.0 milestone Feb 2, 2017

mbauman added the breaking This change will break code label Feb 15, 2017

JeffBezanson added this to the 1.0 milestone May 2, 2017

StefanKarpinski assigned nalimilan Jul 6, 2017

StefanKarpinski closed this as completed in #16401 Dec 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get rid of special-casing of ranges in == and isequal() for AbstractArrays #16364

Get rid of special-casing of ranges in == and isequal() for AbstractArrays #16364

nalimilan commented May 14, 2016

timholy commented May 14, 2016

nalimilan commented May 14, 2016

timholy commented May 14, 2016

nalimilan commented May 14, 2016

timholy commented May 14, 2016 •

edited

Loading

nalimilan commented May 17, 2016 •

edited

Loading

StefanKarpinski commented Jan 26, 2017

StefanKarpinski commented Feb 2, 2017

nalimilan commented Feb 2, 2017

StefanKarpinski commented Feb 2, 2017

StefanKarpinski commented Feb 2, 2017

nalimilan commented Feb 3, 2017

mbauman commented Feb 15, 2017

StefanKarpinski commented Feb 15, 2017

Get rid of special-casing of ranges in == and isequal() for AbstractArrays #16364

Get rid of special-casing of ranges in == and isequal() for AbstractArrays #16364

Comments

nalimilan commented May 14, 2016

timholy commented May 14, 2016

nalimilan commented May 14, 2016

timholy commented May 14, 2016

nalimilan commented May 14, 2016

timholy commented May 14, 2016 • edited Loading

nalimilan commented May 17, 2016 • edited Loading

StefanKarpinski commented Jan 26, 2017

StefanKarpinski commented Feb 2, 2017

nalimilan commented Feb 2, 2017

StefanKarpinski commented Feb 2, 2017

StefanKarpinski commented Feb 2, 2017

nalimilan commented Feb 3, 2017

mbauman commented Feb 15, 2017

StefanKarpinski commented Feb 15, 2017

timholy commented May 14, 2016 •

edited

Loading

nalimilan commented May 17, 2016 •

edited

Loading