Change fieldnames() and propertynames() to return a tuple rather than an array #25725

nalimilan · 2018-01-24T11:10:59Z

Using an immutable structure makes sense since the names cannot be modified, and it avoids an allocation.

Note: applying the change to propertynames is somewhat less natural given that the number of fields can vary depending on runtime values (as can be seen in the diff). Not sure whether that can be a problem in practice: if so, we could keep returning an array for that function.

smldis · 2018-01-24T11:24:48Z

For the Note:
This means propertynames return type cannot be inferred in general.
Do we have a way to stabilize the type instability by collecting the (length unstable) tuple to an array after calling propertynames()?

For example using Vector{Symbol}(collect(propertynames(x)))

Thx for fixing that!

KristofferC · 2018-01-24T12:40:23Z

+1 for returning an array for propertynames.

nalimilan · 2018-01-24T12:57:09Z

Do we have a way to stabilize the type instability by collecting the (length unstable) tuple to an array after calling propertynames()?

For example using Vector{Symbol}(collect(propertynames(x)))

You can always collect the tuple to an array, but the type-instability related to the length of the tuple will still affect the code which works with the tuple.

FWIW, the propertynames methods defined in LinearAlgebra don't seem to be a real problem, because the length of the returned tuple only depends on the private::Bool argument, and these functions are going to be inlined anyway given how simple they are (which should allow inference to get rid of the unused branch).

The type instability problem will be more serious for types like DataFrame, for which the property names are fully dynamic. But since type instabilities happen all the time with DataFrame, I'm not sure it's really a problem.

nalimilan · 2018-01-24T15:58:43Z

Actually, I had forgotten that propertynames falls back to fieldnames, so at least in some cases it has to return the same type as fieldnames. Some types could be allowed to return a vector, though.

KristofferC · 2018-01-24T16:00:30Z

Actually, I had forgotten that propertynames falls back to fieldnames, so at least in some cases it has to return the same type as fieldnames

propertynames(x) could just be changed to collect(fieldnames(x))?

nalimilan · 2018-01-24T16:06:31Z

Right, we could do that too.

vtjnash · 2018-01-24T16:50:40Z

and it avoids an allocation.

It allocates a tuple, so this change doesn't make it allocate less or make it more inferrable (It's based on fieldname, which can't easily be inferred). If we do decide we want to do this, we need to change the structure of how these names are stored so that we are returning the existing object rather than necessarily copying it (from svec to vector or tuple).

smldis · 2018-01-24T17:20:34Z

We are discussing the interface, not the implementation. But it seems the choice is driven by the optimizability (implementation can come later) of the collection of that tuple into an array without the traditional performance impact of the type unstability.

if we use an array then the allocation can still be avoided by a mutable propertynames!(myarray, mytypeinstance)

nalimilan · 2018-01-29T22:12:42Z

More opinions?

StefanKarpinski · 2018-02-08T22:24:20Z

Triage favors rebasing and merging this (with the exception of @vtjnash who dissents).

JeffBezanson · 2018-02-08T22:25:04Z

I'm +1 on this. But, I would also say propertynames is more flexible and can return any collection of symbols, not necessarily a tuple.

… an array Using an immutable structure makes sense since the names cannot be modified, and it avoids an allocation.

nalimilan · 2018-02-09T09:29:00Z

OK, I've rebased and changed the propertyname docstring to say "tuple or vector" so that custom types can use either type depending on whether the number of elements is statically known or not.

StefanKarpinski · 2018-02-09T17:44:11Z

@JeffBezanson, I've assigned you to review – whenever you have a chance, just take a look and merge since its tests are as solid green as it's possible to get at the moment.

nalimilan mentioned this pull request Jan 24, 2018

Should fieldnames return a tuple rather than an array? #25327

Closed

nalimilan force-pushed the nl/fieldnames branch from 085290e to 40dbfd7 Compare January 24, 2018 12:58

nalimilan added the triage This should be discussed on a triage call label Feb 2, 2018

StefanKarpinski removed the triage This should be discussed on a triage call label Feb 8, 2018

StefanKarpinski added this to the 1.0 milestone Feb 8, 2018

Change fieldnames() and propertynames() to return a tuple rather than…

1ebc2ce

… an array Using an immutable structure makes sense since the names cannot be modified, and it avoids an allocation.

nalimilan force-pushed the nl/fieldnames branch from 40dbfd7 to 1ebc2ce Compare February 9, 2018 09:23

StefanKarpinski requested a review from JeffBezanson February 9, 2018 17:43

JeffBezanson approved these changes Feb 9, 2018

View reviewed changes

JeffBezanson merged commit 16620b6 into master Feb 9, 2018

JeffBezanson deleted the nl/fieldnames branch February 9, 2018 19:05

vddvss mentioned this pull request Jul 29, 2018

fix broke sortperm() call for 0.7 compat julia-vscode/DocumentFormat.jl#20

Closed

galenlynch mentioned this pull request Aug 18, 2018

The fieldnames and propertynames functions now return a tuple rather than an array JuliaLang/Compat.jl#620

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change fieldnames() and propertynames() to return a tuple rather than an array #25725

Change fieldnames() and propertynames() to return a tuple rather than an array #25725

nalimilan commented Jan 24, 2018

smldis commented Jan 24, 2018 •

edited

Loading

KristofferC commented Jan 24, 2018

nalimilan commented Jan 24, 2018

nalimilan commented Jan 24, 2018

KristofferC commented Jan 24, 2018

nalimilan commented Jan 24, 2018

vtjnash commented Jan 24, 2018

smldis commented Jan 24, 2018

nalimilan commented Jan 29, 2018

StefanKarpinski commented Feb 8, 2018

JeffBezanson commented Feb 8, 2018

nalimilan commented Feb 9, 2018

StefanKarpinski commented Feb 9, 2018 •

edited

Loading

Change fieldnames() and propertynames() to return a tuple rather than an array #25725

Change fieldnames() and propertynames() to return a tuple rather than an array #25725

Conversation

nalimilan commented Jan 24, 2018

smldis commented Jan 24, 2018 • edited Loading

KristofferC commented Jan 24, 2018

nalimilan commented Jan 24, 2018

nalimilan commented Jan 24, 2018

KristofferC commented Jan 24, 2018

nalimilan commented Jan 24, 2018

vtjnash commented Jan 24, 2018

smldis commented Jan 24, 2018

nalimilan commented Jan 29, 2018

StefanKarpinski commented Feb 8, 2018

JeffBezanson commented Feb 8, 2018

nalimilan commented Feb 9, 2018

StefanKarpinski commented Feb 9, 2018 • edited Loading

smldis commented Jan 24, 2018 •

edited

Loading

StefanKarpinski commented Feb 9, 2018 •

edited

Loading