Allow ! on LHS of := #4031

MichaelChirico · 2019-11-07T05:31:11Z

Related to #571, #1710, a few others about allowing NSE on LHS of :=.

Based on the example given here:

keep = c('Species', 'Sepal.Width')
dt[ , setdiff(names(dt), keep) := NULL]
setcolorder(dt, keep)

Would be quite natural/less clumsy/available for chaining to do:

keep = c('Species', 'Sepal.Width')
dt[ , !keep := NULL] #or maybe, !(keep)
setcolorder(dt, keep)

The text was updated successfully, but these errors were encountered:

moodymudskipper · 2019-11-12T10:57:44Z

I've already been confused by the use of ! in data.table and this one gave me pause too (it's not obvious that we mean "not this column" and not, "negate the values of this variable".

I think this ambiguity can be avoided by having select helpers in the style of https://www.rdocumentation.org/packages/dplyr/versions/0.7.2/topics/select_helpers

We would have :

dt[ , !one_of(keep) := NULL]

And then all other useful select helpers would work as well :

dt[ , starts_with("Petal") := NULL]

In tidy select those helpers return numeric indices, so to use several helpers we need to use set functions like intersect(), union() and setdiff(), I'm not sure why this design choice was made, I think logical output makes more sense and allows more compact syntax using | and &.

So for this to work the select helpers should detect the data table context and [.data.table should allow logical indices on the lhs of := in j.

Allowing functions on the rhs would go with this quite well, so we could do things like :

dt[, sapply(.SD, is.factor) := as.character] # or dt[, sapply(.SD, is.factor) := as.character(.)]

jangorecki · 2019-11-29T15:30:18Z

I am not sure about !keep. If names(.SD) will work in LHS then

keep = c('Species', 'Sepal.Width')
dt[ , setdiff(names(.SD), keep) := NULL]
setcolorder(dt, keep)

mik3y64 · 2019-12-16T09:56:48Z

Upvoting this feature. Column selection and deletion are bread and butter of data manipulation. It is not intuitive to setdiff names of columns and then delete them. It involves two extra steps for code author and not intuitive to be read by colleagues or collaborators. It is like typing --1 (double negative signs) to get 1. I am hoping to see a direct way of selecting column using reference semantics.

keep = c("Species", "Sepal.Width")
dat[ , keep := KEEP]

or a more general but less direct approach, as proposed by @MichaelChirico. This is also similar to typing --1 to get 1 but the codes are much simpler.

dat[ , !keep := NULL]
# or
dat[ , -keep := NULL]

ColeMiller1 · 2019-12-29T12:14:24Z

Another route would be a helper function on j to combine .SDcols with j. Use cases:

dt[, update.at(is.factor, as.character)]
dt[, update.at(!keep, NULL)]

dt[, delete.at(!keep)]

dt[, select.at(keep)]
dt[, select.at(keep, x+ 3)]

MichaelChirico · 2024-04-12T15:17:30Z

With names(.SD) available on LHS of :=, this now works:

dt[ , names(.SD) := NULL, .SDcols=!keep]

Closing here, please open other FRs if there's still something missing.

MichaelChirico mentioned this issue Nov 7, 2019

Scope to include an argument on setcolorder to remove non-referenced columns #4030

Closed

MichaelChirico mentioned this issue Dec 14, 2019

Feature request - support reference semantics for column selection #4111

Closed

ColeMiller1 mentioned this issue Jan 8, 2020

names(.SD) should work #4163

Merged

4 tasks

qmarcou mentioned this issue Jul 27, 2021

Assign or set by reference to a function of LHS of ':=' in j #5081

Closed

MichaelChirico closed this as completed Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow ! on LHS of := #4031

Allow ! on LHS of := #4031

MichaelChirico commented Nov 7, 2019

moodymudskipper commented Nov 12, 2019

jangorecki commented Nov 29, 2019

mik3y64 commented Dec 16, 2019

ColeMiller1 commented Dec 29, 2019

MichaelChirico commented Apr 12, 2024

Allow ! on LHS of := #4031

Allow ! on LHS of := #4031

Comments

MichaelChirico commented Nov 7, 2019

moodymudskipper commented Nov 12, 2019

jangorecki commented Nov 29, 2019

mik3y64 commented Dec 16, 2019

ColeMiller1 commented Dec 29, 2019

MichaelChirico commented Apr 12, 2024