-
Notifications
You must be signed in to change notification settings - Fork 229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FeatureRequest] Add domain for HWthreads closest to GPUs #534
Comments
i am also interested in being able to support cpu pinning in combination with gpu usage. what is the current best practise wrt likwidpin? @JanLJL you mentioned a very recent issue we had was people running torchrun with python code doing dataloader+train, and dataloaders. the dataload+train is what nvidia-smi reports as using the gpu, the remaining dataloaders are child processes of the train+dataload. torchrun is really crappy in pinning correctly, so we are looking for a way to "help" it. likwidpin would be a good candidate for this, but it's unclear how one woud invoke it |
Hello, Thanks for increasing priority on this feature request. The current workflow would be to run One big question for this feature request is whether I never tried Hierarchies are currently not supported but also not needed. |
Is your feature request related to a problem? Please describe.
Often, GPUs are not closest to the NUMA domain a humain might think (e.g., GPU 3 is closest to NUMA domain 0, etc). Not every user remembers to run
likwid-topology
first to get the corresponding NUMA domains for their GPU(s).Describe the solution you'd like
Add a affinity domain for
likwid-pin
andlikwid-perfctr
, e.g.,G
for placing HW threads close to the GPU.For example, pinning 10 HWthreads closest to GPU 1:
The text was updated successfully, but these errors were encountered: