Arraypocalypse Now and Then #255

mbauman · 2015-09-15T23:41:19Z

This issue supersedes the 0.4 work towards array nirvana (#7941), and ~~will~~ tracks the issues we aim to complete during 0.5 and beyond — now updated through work on 0.7. This is an umbrella issue and will track specific tasks in other issues. Please feel free to add things that I've missed.

Required underlying technologies

Julia native bounds checking and removal (#7799). Several tries have been made at this, but I believe the current plan of action is to make @inbounds elide code blocks hidden within an @boundscheck macro, propagating down only one level of inlining (extensible bounds checking removal julia#7799 (comment)). This is a strong requirement for the subsequent steps. (implemented in elide code marked with @boundscheck(...). julia#14474)
ReshapedArrays (#10507). Requires better performance: https://groups.google.com/d/msg/julia-dev/7M5qzmXIChM/kOTlGSIvAwAJ

Major 0.5 breaking behavior changes

Drop dimensions indexed by a scalar (Taking vector transposes seriously #42; more generally, APL-style slicing where the rank of a slice is the sum of the ranks of the indexes, see below). PR at RFC: Drop dimensions indexed by scalars julia#13612.
Flip the switch on the concatenation deprecation (#8599)
Remove default no-op behavior for (c)transpose (#13171)
Change change sub behaviour to slice (#16846)

Major 0.6 breaking behavior changes

Vector transpose returns a covector (Taking vector transposes seriously #42). Implementation in Introduce RowVector as the transpose of a vector julia#19670.
Vector conjugation returns lazy wrapper (#20047)

Possible future breaking changes

Matrix transposition and conjugation return lazy wrappers (#25364)
Return slices as views. A first attempt at this was at RFC: Return views from UnitRange indexing of Arrays julia#9150. Still unclear whether the possible performance changes are consistent and large enough to be worth the breakage. See range indexing should produce a subarray, not a copy julia#3701.
Should reductions drop dimensions? array reductions (sum, mean, etc.) and dropping dimensions julia#16606

New functionality

Allow expression of varargs of defined length (#11242). This allows us to take full advantage of RFC: Give AbstractArrays smart and performant indexing behaviors for free julia#10525.
Ditch special lowering of Ac_mul_Bt, use dispatch on the lazy transpose wrappers instead. (#5332, sunset linalg jazz julia#25217)
Dimensions indexed by multidimensional arrays add dimensions (full APL-style: the dimensionality of the result is the sum of the dimensionalities of the indices). (#15431)
~~Allow any index type in non-scalar indexing (#12567).~~ ~~Tighten scalar indexing to indices <: Integer and widen non scalar indexing to <: Union{Number, AbstractArray, Colon} (RFC: Allow any index type in nonscalar indexing julia#12567 (comment)).~~ More systematic conversion of indices such that any index type can be converted into an Int or AbstractArray: RFC: Speedier, simpler and more systematic index conversions julia#19730
Easier creation of immutable arrays with tuples and WIP: add support for working with immutables (#11902) julia#12113.

Other speculative possibilities

The text was updated successfully, but these errors were encountered:

tbreloff · 2015-09-16T00:14:53Z

This looks like a great list Matt, thanks. I'm a little scared of the
fallout but it'll be a huge leap forward for the language.

On Tuesday, September 15, 2015, Matt Bauman [email protected]
wrote:

This issue supersedes the 0.4 work towards array nirvana (#7941
JuliaLang/julia#7941), and will track the
issues we aim to complete during 0.5. This is an umbrella issue and will
track specific tasks in other issues. Please feel free to add things that
I've missed (at least during the first half of 0.5).

Required underlying technologies

Julia native bounds checking and removal (#7799
extensible bounds checking removal julia#7799). Several tries have
been made at this, but I believe the current plan of action is to make
@inbounds elide code blocks hidden within an @BoundsCheck macro,
propagating down only one level of inlining (#7799 (comment)
extensible bounds checking removal julia#7799 (comment)).
This is a strong requirement for the subsequent steps.

ReshapedArrays (#10507
WIP: ReshapedArrays julia#10507). Requires better
performance:
https://groups.google.com/d/msg/julia-dev/7M5qzmXIChM/kOTlGSIvAwAJ

Breaking behavior changes

Drop dimensions indexed by a scalar (#4774 (comment)
Taking vector transposes seriously #42)

Return slices as views. A first attempt at this was at RFC: Return views from UnitRange indexing of Arrays julia#9150
RFC: Return views from UnitRange indexing of Arrays julia#9150. Special attention may
be needed for BitArrays.

Flip the switch on the concatenation deprecation (#8599
WIP: Make [a, b] non-concatenating julia#8599)

New functionality

Allow expression of varargs of defined length (#11242
NTuples made me sad (so I nixed them) julia#11242). This allows us to
take full advantage of RFC: Give AbstractArrays smart and performant indexing behaviors for free julia#10525
RFC: Give AbstractArrays smart and performant indexing behaviors for free julia#10525.

Transpose returns a covector or transpose type (#4774 (comment)
Taking vector transposes seriously #42)

Ditch special lowering of Ac_mul_Bt, use dispatch instead.

Dimensions indexed by multidimensional arrays add dimensions (full
APL-style: the dimensionality of the result is the sum of the
dimensionalities of the indices)

Allow any index type in non-scalar indexing (#12567
RFC: Allow any index type in nonscalar indexing julia#12567)

Easier creation of immutable arrays with tuples and WIP: add support for working with immutables (#11902) julia#12113
WIP: add support for working with immutables (#11902) julia#12113.

Other speculative possibilities

A mutable fixed-size buffer type, which would allow for a
Julia-native Array definition (#12447
Introduce Buffer type and make Array an abstraction on top of it julia#12447)

Base IndexSet on BitArray or perhaps any AbstractArray{Bool}.

Rework nonscalar indexing to prevent calling find on logical arrays
and simply wrap it with an IndexSet instead?

Negated indexing with complement IndexSet? (Perhaps in a package)

—
Reply to this email directly or view it on GitHub
#255.

JeffBezanson · 2015-09-16T00:22:10Z

Yes, great list! Let's roll up our sleeves!

tbreloff · 2015-09-16T00:26:18Z

Are there any pieces of this that are unclaimed? I'd like to help, but I
don't want to step on anyone's toes.

On Tuesday, September 15, 2015, Jeff Bezanson [email protected]
wrote:

Yes, great list! Let's roll up our sleeves!

—
Reply to this email directly or view it on GitHub
#255.

StefanKarpinski · 2015-09-16T02:21:17Z

It's a phenomenal list. There's actually only a few changes that are very breaking, which is nice, but those are significant enough.

mbauman · 2015-09-16T02:43:35Z

There is definitely more than enough work to go around! I don't think any of these tasks are claimed.

Things like dropping scalar dimensions are rather simple changes, but will take a lot of work finding and fixing bugs... and that part is easy to collaborate on. Same goes for views (if you ignore the perf issues with ReshapedArrays and inbounds). Anyone is welcome to dig in!

jakebolewski · 2015-09-16T02:48:21Z

Views is hard, you have to make it through bootstrap without 🔪 yourself in the 👀.

StefanKarpinski · 2015-09-16T03:00:22Z

Having just done a bunch of work to get string changes through bootstrap, I'm inclined to believe this.

milktrader · 2015-09-16T11:33:45Z

Thanks for doing this @mbauman, so much to digest

jiahao · 2015-09-16T13:55:10Z

I've added "Remove default no-op behavior for (c)transpose" as an item. I expect much complaining, but as we've discussed before, it's simply wrong to assume that <:Any is a scalar and the logic error rears its head every time one tries to wrap and/or implement custom array/matrix types. cc @jakebolewski @andreasnoack

StefanKarpinski · 2015-09-16T14:10:58Z

I think we need to think through the options for that carefully. It's pretty idiomatic to write A' to transpose a non-complex matrix.

mbauman · 2015-09-16T14:17:57Z

Isn't it possible that the (c)transpose wrapper will solve this issue? There's a lot of design work that will need to go into it, but:

transpose(A::AbstractVectorOrMatrix) = TransposeType(A) # Will call `transpose` upon indexing, too
transpose(::AbstractArray) = error("cannot take transpose of 3+ dim array") # yeah, error methods aren't the best…
transpose(x) = x

carnaval · 2015-09-16T14:22:18Z

related: I think some of the typing problems in linalg (and transpose behavior) comes from the fact that we represent a blocked linear operator as an array of arrays. We may want to switch to a new type that knows of the various sizes inside it for that, I remember discussing that with @andreasnoack. 0.5 might be a time to at least think about it.

jiahao · 2015-09-16T14:30:06Z

Isn't it possible that the (c)transpose wrapper will solve this issue?

Maybe; we'd have to think about it.

The blocking issue last time is that transpose has to be recursive to handle cases like Matrix{Matrix} (Example: A = [rand(1:5, 2, 2) for i=1:2, j=1:2]; A') correctly, but people want to write A' on 2D arrays on non-numeric types (e.g. images, as Matrix{<:Colorant}) and expect the transpose to not apply to the scalar elements. The no-op transpose(x::Any) method exists to handle these cases. However, this definition conflicts with matrix-like objects, which have the algebraic semantics of matrices but are not stored internally in any array-like form, and hence by JuliaLang/julia#987 should not be an AbstractArray (QRCompactWYQ is the poster child, but we have many such examples). If you introduce a new matrix-like type, you have to explicitly define (c)transpose otherwise you get the no-op fallback which is a source of many bugs.

To be clear, the behavior we would break explicitly is the claim (which you can find in the help for permutedims) that

Transpose is equivalent to permutedims(A, [2,1]).

This equivalence makes no sense for types that are not AbstractArrays and that are AbstractArrays of non-scalars, and we actually do have matrix-like types that need this more abstract sense of transpose.

tbreloff · 2015-09-16T14:39:03Z

I think assuming that a Matrix{Matrix} will automatically recursively
transpose the elements is bad and dangerous. I'd rather see a special type
BlockMatrix{Matrix} that does what you're looking for.

On Wed, Sep 16, 2015 at 10:30 AM, Jiahao Chen [email protected]
wrote:

Isn't it possible that the (c)transpose wrapper will solve this issue?

Maybe; we'd have to think about it.

The blocking issue last time is that transpose has to be recursive to
handle cases like Matrix{Matrix} (Example: A = [rand(1:5, 2, 2) for
i=1:2, j=1:2]; A') correctly. However, people want to write A' on 2D
arrays on non-numeric types (e.g. images, as Matrix{<:Colorant}) and
expect the transpose to not apply to the scalar elements. The no-op
transpose(x::Any) method exists to handle these cases. However, this
definition conflicts with matrix-like objects, which have the algebraic
semantics of matrices but are not stored internally in any array-like form,
and hence by JuliaLang/julia#987 JuliaLang/julia#987 should
not be an AbstractArray (QRCompactWYQ is the poster child, but we have
many such examples). If you introduce a new matrix-like type, you have to
explicitly define (c)transpose otherwise you get the no-op fallback which
is a source of many bugs.

To be clear, the behavior we would break explicitly is that

Transpose is equivalent to permutedims(A, [2,1]).

would now be false. This equivalence makes no sense for types that are not
AbstractArrays, and we actually do have matrix-like types that need this
more abstract sense of transpose.

—
Reply to this email directly or view it on GitHub
#255.

jiahao · 2015-09-16T14:40:45Z

@tbreloff that's exactly my point. Most people think it's odd that transpose should be recursive, but it must be to be a mathematically correct transpose, and that disquiet exposes corner cases where transposition is not simply permutedims(A, [2,1]). (Although it’s true that Matrix{Matrix}} is not really a blocked matrix type, because there are absolutely no guarantees that the inner matrices have dimensions that are consistent with any partitioning of a larger matrix.)

mbauman · 2015-09-16T15:10:07Z

Ah, yes, I forgot about all the Tensor-like objects that aren't AbstractArrays. No matter what happens, either the authors of Tensor-like objects will need to communicate to Julia somehow that they're not scalars (sometimes being an AbstractArray works, but not always), or the authors of Scalar-like objects will need to do the reverse (sometimes being a Number works, but not always), or both. This same sort of scalar-or-not question rears its head all over the place… e.g., indexing: JuliaLang/julia#12567.

Right now we require a mishmash of method specializations for the tensors with some scalar-like fallbacks. This means that some get forgotten and we end up with scalar methods getting called, returning the wrong result.

Since we can't communicate this through supertypes (JuliaLang/julia#987 (comment)), I think it's gotta either be through better documentation of the required methods or having those methods explicitly encoded into a traits-based system. And if we remove all fallbacks (which ensures correct behavior at the cost of missing methods for everyone), I think we need to make this as simple as possible with traits.

johnmyleswhite · 2015-09-16T15:12:06Z

+1

I really think we should start enumerating traits for AbstractArray's. Linear indexing seems to have been an amazing example of the power of a few careful design decisions involving traits.

mbauman · 2015-09-16T16:29:57Z

One possible solution would be to keep AbstractArray <: Any, and introduce AbstractNonArrayTensor <: Any alongside it. Everything else would be considered "scalar" as far as indexing and linear algebra are concerned.

Note that this is distinct from and much more well-defined than an Atom vs. Collection distinction (JuliaLang/julia#7244 (comment)); A[:] = (1,2,3) and A[:] = "123" behave very differently from A[:] = 1:3 for A = Array(Any, 3), as they should.

ScottPJones · 2015-09-16T17:01:45Z

I really think we should start enumerating traits for AbstractArray's.

@johnmyleswhite By any chance did you mean language supported "traits"? That's one thing I've really wanted to see in the language since JuliaCon.

johnmyleswhite · 2015-09-16T18:39:23Z

Yes, I meant language supported traits.

ScottPJones · 2015-09-16T18:42:18Z

Do you have any ideas/suggestions about how traits could be added to Julia, syntactically? They would be very useful for arrays, strings, encodings, at the very least.

johnmyleswhite · 2015-09-16T18:44:26Z

I prefer leaving those decisions to Jeff rather than speculating.

ViralBShah · 2015-09-16T18:58:44Z

Given that this is an umbrella issue, it would be nice to discuss specific items in their own issues.

ScottPJones · 2015-09-16T19:05:29Z

I do think though that having traits in the language might substantially change the design for Arrays, which is why discussion of traits, at least in the area of how they could be used for better Array abstractions, would be useful.

mbauman · 2015-09-16T19:22:38Z

Please move traits discussion to JuliaLang/julia#5, and the (c)transpose issue to JuliaLang/julia#13171.

StefanKarpinski · 2015-09-16T19:23:31Z

I don't think the syntax for traits needs to be figured out before figuring out what traits are important for arrays. In fact, having more examples of traits that we actually need is excellent for helping design the language feature.

ScottPJones · 2015-09-16T20:42:50Z

Ok, good point, as long as traits are being thought of as part of the design.

ScottPJones · 2015-09-16T20:44:35Z

Upper and lower for triangular matrices? Those seem like good candidates for being done as traits to me.

davidanthoff · 2016-05-24T16:58:52Z

We've had that for ages; see sub and slice, which got fast in time for julia-0.4. We now have a few additional view types, too (at least ReshapedArray and the unexported PermutedDimsArray).

Has there been any consideration of renaming sub to view? sub really doesn't indicate very well that it will return a view...

JaredCrean2 · 2016-05-24T17:08:01Z

view is used by the ArrayViewspackage which currently has significant performance advantages in certain cases (small contiguous views).

andreasnoack · 2016-05-24T17:14:23Z

Following up on @davidanthoff, I think we should deprecate either sub or slice and it should probably be sub now that getindex behavior matches slice.

mbauman · 2016-05-24T17:19:14Z

Is the problem with allowing floats to be used for indexing technical or philosophical?

It's a mix of both, but if we detangle scalar/nonscalar to_index like I suggest above then that removes (the last of?) the technical reasons. Linear indexing requires doing math on the indices, which requires integers. A lot has changed since we merged that deprecation (JuliaLang/julia#10458).

davidanthoff · 2016-05-24T17:19:20Z

Ah, I had thought the base capability is now a complete superset of the ArrayViews stuff and always preferred... It is a bit of a shame that the more intuitive name is used in the package and base is left with something less clear...

StefanKarpinski · 2016-05-24T17:29:19Z

It seems unfortunate and somewhat backwards for a name used in a package to block choosing a much clearer name for a function in Base. Perhaps the way forward is to eliminate the performance advantage of ArrayViews, deprecate that package, and change the function name to Base.view.

timholy · 2016-05-24T17:34:57Z

My assumption/hope is that the performance gap will go away when we are able to ~~heap~~stack-allocate containers with references...and I am skeptical that there's anything we can do about it without that. The ArrayView wrapper is just smaller, so being able to inline & elide the wrapper creation should do the trick.

That's why the difference only shows up with creation of small arrays---on most other benchmarks, SubArray equals or outperforms ArrayViews.

Oh, and I agree about deprecating sub.

JeffBezanson · 2016-05-24T18:06:50Z

Base and ArrayViews could both use view. After all, ImageView also uses view :)

JaredCrean2 · 2016-05-24T18:49:58Z

I suppose that could work because ArrayViews defines methods only for Array but Base defines methods for AbstractArray (I think?). How would this work when different scopes exists:

module A
  function myfunc(a::AbstractMatrix)
     av = view(a, :, 1)
     # do something with av
   end
end

module B 
  using ArrayViews
end

Does typeof(av) change depending on whether module B has been loaded (assuming a is an Array)?

timholy · 2016-05-24T18:56:01Z

ArrayViews would have to stop exporting view, and anytime you wanted to use ArrayViews' version you'd say ArrayViews.view(A, :, 1).

ImageView would also have to stop exporting view, but I'm fine with that idea.

StefanKarpinski · 2016-06-16T16:51:46Z

Of the remaining issues, only #57 and JuliaLang/julia#16846 remain for 0.5; moving this issue to 0.6.

timholy · 2016-06-16T18:41:06Z

I'm also planning to do JuliaLang/julia#16260 (comment):

transitionally (julia-0.5 only) make size and length throw an error for arrays with unconventional indexing
introduce @arraysafe to rewrite calls to size and length to something that doesn't throw an error
merge allocate_for into similar

Probably won't be done until after JuliaCon, unfortunately. In that same discussion, @eschnett has made a case for introducing a separate type for linear indexing, and pointed out that the introduction of linearindices is an opportune moment to do so; I don't disagree, but I don't think I'll have time to tackle that myself, so that one is up-for-grabs.

StefanKarpinski · 2016-06-16T18:54:22Z

As long as you're on it, @timholy – needs to be done by next week so we can tag an RC. If you'd like to open an issue and put it in the 0.5.0 milestone so we can track it, you can.

ufechner7 · 2016-06-17T16:54:13Z

Shouldn't the title be adapted, if the milestone is moved?

JeffBezanson · 2017-01-06T04:04:33Z

I believe the 0.6 parts of this are reflected in more specific issues and PRs; moving to 1.0.

stevengj · 2017-01-21T03:59:01Z

See also JuliaLang/julia#20164 to more easily opt-in to views for a large block of code.

StefanKarpinski · 2017-07-20T17:48:37Z

Everything in this issue is either done, has its own issue, or isn't going to happen (I checked).

mbauman mentioned this issue Sep 15, 2015

Towards array nirvana JuliaLang/julia#7941

Closed

15 tasks

mbauman mentioned this issue Sep 16, 2015

Make (c)transpose less error-prone #257

Closed

mbauman mentioned this issue May 25, 2016

Create @view macro for creating SubArrays via indexing. JuliaLang/julia#16564

Merged

simonbyrne mentioned this issue Jun 9, 2016

Change sub behaviour to match getindex, possibly rename. JuliaLang/julia#16846

Closed

StefanKarpinski changed the title ~~Arraypocalypse Now (0.5 release)~~ Arraypocalypse Now (0.5 release and onward) Jun 17, 2016

StefanKarpinski changed the title ~~Arraypocalypse Now (0.5 release and onward)~~ Arraypocalypse Now Jun 17, 2016

StefanKarpinski changed the title ~~Arraypocalypse Now~~ Arraypocalypse Now and Then Jun 17, 2016

This was referenced Jul 21, 2016

treat .= as syntactic sugar for broadcast! JuliaLang/julia#17510

Merged

improve a[...] .= handling of arrays of arrays and dicts of arrays JuliaLang/julia#17568

Merged

jiahao mentioned this issue Sep 2, 2016

Taking vector transposes seriously #42

Closed

StefanKarpinski closed this as completed Jul 20, 2017

simonbyrne mentioned this issue Jul 20, 2017

RFC: Return views from UnitRange indexing of Arrays JuliaLang/julia#9150

Closed

tpapp mentioned this issue Oct 4, 2017

size padding: document or remove JuliaLang/julia#23985

Open

tlienart mentioned this issue Mar 27, 2020

Chasing dead links JuliaLang/www.julialang.org#690

Closed

KristofferC transferred this issue from JuliaLang/julia Nov 26, 2024

Arraypocalypse Now and Then #255

Arraypocalypse Now and Then #255

Comments

mbauman commented Sep 15, 2015 • edited Loading

Major 0.5 breaking behavior changes

Major 0.6 breaking behavior changes

Possible future breaking changes

tbreloff commented Sep 16, 2015

JeffBezanson commented Sep 16, 2015

tbreloff commented Sep 16, 2015

StefanKarpinski commented Sep 16, 2015

mbauman commented Sep 16, 2015

jakebolewski commented Sep 16, 2015

StefanKarpinski commented Sep 16, 2015

milktrader commented Sep 16, 2015

jiahao commented Sep 16, 2015

StefanKarpinski commented Sep 16, 2015

mbauman commented Sep 16, 2015

carnaval commented Sep 16, 2015

jiahao commented Sep 16, 2015

tbreloff commented Sep 16, 2015

jiahao commented Sep 16, 2015

mbauman commented Sep 16, 2015

johnmyleswhite commented Sep 16, 2015

mbauman commented Sep 16, 2015

ScottPJones commented Sep 16, 2015

johnmyleswhite commented Sep 16, 2015

ScottPJones commented Sep 16, 2015

johnmyleswhite commented Sep 16, 2015

ViralBShah commented Sep 16, 2015

ScottPJones commented Sep 16, 2015

mbauman commented Sep 16, 2015

StefanKarpinski commented Sep 16, 2015

ScottPJones commented Sep 16, 2015

ScottPJones commented Sep 16, 2015

davidanthoff commented May 24, 2016

JaredCrean2 commented May 24, 2016

andreasnoack commented May 24, 2016

mbauman commented May 24, 2016

davidanthoff commented May 24, 2016

StefanKarpinski commented May 24, 2016

timholy commented May 24, 2016 • edited Loading

JeffBezanson commented May 24, 2016

JaredCrean2 commented May 24, 2016 • edited Loading

timholy commented May 24, 2016

StefanKarpinski commented Jun 16, 2016

timholy commented Jun 16, 2016 • edited Loading

StefanKarpinski commented Jun 16, 2016

ufechner7 commented Jun 17, 2016

JeffBezanson commented Jan 6, 2017

stevengj commented Jan 21, 2017

StefanKarpinski commented Jul 20, 2017

mbauman commented Sep 15, 2015 •

edited

Loading

timholy commented May 24, 2016 •

edited

Loading

JaredCrean2 commented May 24, 2016 •

edited

Loading

timholy commented Jun 16, 2016 •

edited

Loading