You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On some platforms (e.g. OLCF Summit), MPI ranks' visibility of GPUs is typically restricted with CUDA_VISIBLE_DEVICES.
We currently require that all ranks be able to see all GPUs, so we can detect GPU distance, for example:
On some platforms (e.g. OLCF Summit), MPI ranks' visibility of GPUs is typically restricted with CUDA_VISIBLE_DEVICES.
We currently require that all ranks be able to see all GPUs, so we can detect GPU distance, for example:
stencil/include/stencil/partition.hpp
Lines 710 to 713 in 6770d3c
If all GPUs have ID 0, our GPU topology code will think all those GPUs are the same device, since according to a particular rank GPU0 is GPU0.
It may be possible to have the ranks report a UUID for each GPU instead of their CUDA id, and use that throughout to distinguish GPUs.
Once we can support this, we can allow users to tie CPU execution to CPUs with affinity for a particular GPU, which could improve performance.
The text was updated successfully, but these errors were encountered: