You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to use plyr on a large data.frame but make sure that I don't accidentally create copies of the dataset which could cause RAM to blow up.
I understand that when using plyr serially in non-parallel mode, it is careful to not make copies of the data.frame. From the original paper:
It is more memory-efficient than the naive split-apply-combine approach because plyr is careful not to make an extra copy of the data in the split step.
However, if using plyr in parallel mode, is the entire data.frame copied at each core? or is only 1 copy used for the entire analysis?
The text was updated successfully, but these errors were encountered:
I want to use
plyr
on a largedata.frame
but make sure that I don't accidentally create copies of the dataset which could cause RAM to blow up.I understand that when using
plyr
serially in non-parallel mode, it is careful to not make copies of thedata.frame
. From the original paper:However, if using
plyr
in parallel mode, is the entiredata.frame
copied at each core? or is only 1 copy used for the entire analysis?The text was updated successfully, but these errors were encountered: