names(.SD) := ... should work #795

brodieG · 2014-09-02T23:49:39Z

This would allow the following type of code:
DT[, names(.SD) := lapply(.SD, rev), .SDcols = -c(1,3,8)]
To reverse every column except 1, 3, and 8 in DT by reference. See discussion in #787. Maybe potentially even:
DT[, := lapply(.SD, rev), .SDcols = -c(1,3,8)]
and have the names of the columns to be updated inferred from the names of the return value to lapply.

The text was updated successfully, but these errors were encountered:

matthieugomez · 2014-11-04T23:13:18Z

That would be very nice.
Another implementation would be that, when .SDcols is not specified, and .SD is present in j, the LHS of := is understood as .SDcols.

brodieG · 2014-11-07T20:47:29Z

Alternate is to create a .SDcols object that is usable in LHS of :=.

Also, using .SD in LHS of := should probably throw an error instead of kind of work. More discussion on SO.

jangorecki · 2014-11-08T09:10:16Z

the straight "workaround" for this would be to use with=FALSE
example in: http://jangorecki.github.io/blog/2014-11-07/Data-Anonymization-in-R.html#minimal-script

still the .SDcols object usable in j (both LHS and RHS) seems to be the best idea.

brodieG · 2014-11-08T16:15:45Z

Jan, not sure with=F is necessary, since you can do:

dt <- data.table(a=1:10, b=1:10, c=rep(c(T,F), 5))
cols <- 1:2
dt[, cols:=lapply(.SD, `*`, 2), .SDcols=cols, with=F]

or

dt[, (cols):=lapply(.SD, `*`, 2), .SDcols=cols]  # add parens
dt

equivalently, though this is a good reminder that with can help in situations when wanting to use data.table programatically, which is another issue I've been discussing with Arun.
Also, note both fail with:

cols <- -3

stefanfritsch · 2015-03-03T15:38:40Z

Another inconsistency with this currently is that

this works:

A<-data.table(x=1:10,y=10:1,z=rnorm(10))

A[,`:=`(colnames(A),.SD)]

and this doesn't:

A[,`:=`(colnames(.SD),.SD)]

The second fails with:

Error in `[.data.table`(A, , `:=`(colnames(.SD), .SD)) : 
  LHS of := isn't column names ('character') or positions ('integer' or 'numeric')

That precludes some elegant .SDcols syntax.

franknarf1 · 2017-11-28T16:10:20Z

SO q to update: https://stackoverflow.com/questions/47535845/perform-operations-on-data-table-columns-based-on-regex?noredirect=1

MichaelChirico · 2018-10-28T15:04:19Z

Unfortunate that this workaround is blocked by :=:

set.seed(23940)
DT = setDT(lapply(integer(10), function(...) sample(1e7, 100)))

DT[ , do.call(`:=`, lapply(.SD, .POSIXct, tz = 'UTC'))]

Error in (function (...) :
Check that is.data.table(DT) == TRUE. Otherwise, := and :=(...) are defined for use in j, once only and in particular ways. See help(":=").

It's unfortunate since the result of lapply is already named, so this is a shorthand for the \`:=\`(V1 = .POSIXct(V1, tz = 'UTC'), ...) approach of explicitly naming columns

DT[ , str(lapply(.SD, .POSIXct, tz = 'UTC'))]
List of 10
 $ V1 : POSIXct[1:100], format: "1970-02-28 08:44:45" "1970-01-26 19:24:38" ...
 $ V2 : POSIXct[1:100], format: "1970-03-14 20:40:49" "1970-01-04 23:53:16" ...
 $ V3 : POSIXct[1:100], format: "1970-01-09 03:32:08" "1970-02-12 06:18:31" ...
 $ V4 : POSIXct[1:100], format: "1970-04-13 04:15:52" "1970-03-17 19:10:23" ...
 $ V5 : POSIXct[1:100], format: "1970-03-22 07:57:03" "1970-02-10 19:42:45" ...
 $ V6 : POSIXct[1:100], format: "1970-01-28 05:56:39" "1970-04-20 11:43:32" ...
 $ V7 : POSIXct[1:100], format: "1970-01-02 03:41:31" "1970-04-01 23:58:52" ...
 $ V8 : POSIXct[1:100], format: "1970-03-05 05:58:29" "1970-03-05 23:27:10" ...
 $ V9 : POSIXct[1:100], format: "1970-04-13 20:29:31" "1970-01-24 12:18:58" ...
 $ V10: POSIXct[1:100], format: "1970-04-22 15:01:36" "1970-03-08 00:33:20" ...
NULL

MichaelChirico · 2019-09-24T09:32:06Z

I actually lean towards allowing .SD on the LHS of :=. More concise and I think the intent is clear. We're doing this with NSE so we can just capture .SD --> names(.SD) anyway.

DT[ , .SD := lapply(.SD, rev), .SDcols = -c(1,3,8)]

That & whatever comes out of #3795 would make adding/editing many columns much less clunky

jangorecki · 2019-09-24T09:39:22Z

or eventually which is more like names(.SD)

DT[ , .SDcols := lapply(.SD, rev), .SDcols = -c(1,3,8)]

grantmcdermott · 2021-11-10T18:15:29Z

I've been thinking about this FR again after having to do quite a bit of "manual" LHS creation in a current project. (FWIW my own preferred option is @MichaelChirico's DT[, .SD := ....], but would support any of the proposed solutions.)

Another possible syntax variant — which would involve even less typing if it is feasible to code up — would be to enable := directly in .SDcols. I'm not sure how others would feel about this, though.

DT[ , lapply(.SD, rev), .SDcols := c(1,3,8)]

MichaelChirico · 2021-11-11T02:42:51Z

It does ~basically read well here, but I would be against that... := semantics are (based on legion user reports/SO Q&A) confusing enough without opening up another API surface for it. Being able to consistently look only for j to know if a table is being updated by reference will keep code more readable than if := could show up in other [ arguments, possibly on other lines, possible separated from j by dozens of lines.

brodieG mentioned this issue Sep 2, 2014

Add a copy argument to [ #787

Closed

eantonya added the feature request label Sep 3, 2014

stefanfritsch mentioned this issue Mar 3, 2015

Allow logical specification in .SDcols #1060

Closed

MichaelChirico mentioned this issue Aug 2, 2016

Reference original table when specifying .SDcols #1786

Closed

franknarf1 mentioned this issue Feb 17, 2017

colA:colB syntax should also work on LHS of LHS := RHS #1710

Closed

MichaelChirico mentioned this issue Oct 15, 2019

Master list of most-requested issues #3189

Open

75 tasks

ColeMiller1 mentioned this issue Jan 8, 2020

names(.SD) should work #4163

Merged

4 tasks

MichaelChirico added the High label May 30, 2020

jangorecki added this to the 1.12.11 milestone Jun 27, 2020

jangorecki added top request One of our most-requested issues and removed High labels Oct 15, 2020

mattdowle modified the milestones: 1.13.1, 1.13.3 Oct 17, 2020

jangorecki mentioned this issue Dec 17, 2020

make it possible to drop columns using .SD #4853

Closed

MichaelChirico mentioned this issue Feb 15, 2021

Unable to assign columns using (names(.SD)) := #4905

Closed

qmarcou mentioned this issue Jul 27, 2021

Assign or set by reference to a function of LHS of ':=' in j #5081

Closed

hendrikvanb mentioned this issue Jul 28, 2021

Programmatically pass expressions to .SDcols #5083

Closed

mattdowle removed this from the 1.14.1 milestone Aug 28, 2021

MichaelChirico mentioned this issue Sep 7, 2021

argument to which is not logical #5135

Closed

Henrik-P mentioned this issue Aug 17, 2022

In-place replacement of .SDcols #5437

Closed

jangorecki mentioned this issue Nov 15, 2023

Error: object '.SDcols' not found #5742

Closed

MichaelChirico closed this as completed in #4163 Mar 20, 2024

phgrosjean mentioned this issue Jul 29, 2024

vignette datatable-sd-usage.Rmd: erreur LHS of := isn't column names phgrosjean/rfrench#45

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

names(.SD) := ... should work #795

names(.SD) := ... should work #795

brodieG commented Sep 2, 2014

matthieugomez commented Nov 4, 2014

brodieG commented Nov 7, 2014

jangorecki commented Nov 8, 2014

brodieG commented Nov 8, 2014

stefanfritsch commented Mar 3, 2015

franknarf1 commented Nov 28, 2017

MichaelChirico commented Oct 28, 2018

MichaelChirico commented Sep 24, 2019

jangorecki commented Sep 24, 2019 •

edited

Loading

grantmcdermott commented Nov 10, 2021 •

edited

Loading

MichaelChirico commented Nov 11, 2021 •

edited

Loading

names(.SD) := ... should work #795

names(.SD) := ... should work #795

Comments

brodieG commented Sep 2, 2014

matthieugomez commented Nov 4, 2014

brodieG commented Nov 7, 2014

jangorecki commented Nov 8, 2014

brodieG commented Nov 8, 2014

stefanfritsch commented Mar 3, 2015

franknarf1 commented Nov 28, 2017

MichaelChirico commented Oct 28, 2018

MichaelChirico commented Sep 24, 2019

jangorecki commented Sep 24, 2019 • edited Loading

grantmcdermott commented Nov 10, 2021 • edited Loading

MichaelChirico commented Nov 11, 2021 • edited Loading

jangorecki commented Sep 24, 2019 •

edited

Loading

grantmcdermott commented Nov 10, 2021 •

edited

Loading

MichaelChirico commented Nov 11, 2021 •

edited

Loading