-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
std thread hardware_concurrency() should not be used #130
Comments
In fact, the solution from
|
Hey, thanks for the report! Switching to the folly implementation shouldn't be too much trouble. I'll consider that for the next release. |
Fixed in e4971b4. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello,
std::thread::hardware_concurrency()
returns, when possible, the underlying hardware capability to run threads, which might not corresponds to the actual number of cores available to the process (through the use of taskset, batch system like slurm, etc...). The consequence is thatmkdwarfs
might run in a non optimal way. For example, if I runtaskset -c 1 mkdwarfs
on my 20 cores machines, it will run 20 workers on only one core.The immediate workaround is to use the -N option to set the number of workers, but I think a more sane behavior would be to use sched_getaffinity as in opencv/opencv#16268. Gromacs did something similar (https://github.com/gromacs/gromacs/blob/1e6873fadf16d5f5be861e6f9ef5f9923a12e540/src/gromacs/hardware/hardwaretopology.cpp#L1221).
What do you think ?
Thank you.
The text was updated successfully, but these errors were encountered: