-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
agent$validation_set$tbl_checked - why keep a copy of the dataframe for every rule? #542
Comments
Thanks - I think an option to drop I'll just note two things:
Lines 324 to 330 in 362b706
I can take some time to look into whether not having |
I can confirm that this is just dormant metadata! I think the simplest implementation is to introduce an argument to Two tangentially related thoughts:
@rich-iannone Let me know if you have thoughts on these! |
Sorry for the extra late reply here :\ I think the only function that actually uses pointblank/R/get_sundered_data.R Line 344 in 4605b30
and then the function iterates through the data frames / tibbles. Now, this is just for that one function AFAICT so the idea of an argument to opt out of storing these dfs is still a good one. We could have a warning in |
Ah you're right! I missed this by only checking I'll move this into a PR to test it more formally! |
Hello,
I have the following question. I have a dataframe with ca. 136'000 rows and 11 columns . On this dataframe, I define 36 rules. I define the agent, add validation rules to it and then interrogate it. My final object is the interrogated agent. The final object consumes approx. 310 MB of memory. On further analysis (running
object.size
on list components at different levels), I realise that this is due toagent$validation_set$tbl_checked
object (ca. 300 MB). There seems to be a copy of the dataframe stored for (almost) every rule we define, particularly the ones performed on single cells. This motivates two questions from my side:Providing a simple example with a simple dataframe:
`
`
Thanks a lot,
Alissia
The text was updated successfully, but these errors were encountered: