Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use all threads by default #5008

Closed
waynelapierre opened this issue May 17, 2021 · 3 comments
Closed

use all threads by default #5008

waynelapierre opened this issue May 17, 2021 · 3 comments

Comments

@waynelapierre
Copy link

Why not make the data.table package use all threads by default?

@jangorecki
Copy link
Member

@waynelapierre if you look through the issues you will probably find the reason.

@mattdowle
Copy link
Member

But, @jangorecki, it might be hard to search for! I tried a few google searches and I didn't come up with anything that returned the following.

See new feature 3 in v1.12.2 (Apr 2019) :

image

So if you prefer to use all logical CPUs by default, on servers for example, just set environment variable R_DATATABLE_NUM_PROCS_PERCENT to 100%. Depending on the task, you'll need to experiment what works best for your data and your machine.

@jangorecki
Copy link
Member

@waynelapierre Please subscribe to h2oai/db-benchmark#202 for more precise information about threads vs timing performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants