-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA support for default Open-MPI implementation #1162
Comments
Thanks for pointing this out, we install openmpi from the efa installers, which in turn build the rpm's. We can add cuda support in a future release, until then you'll need to compile from source: |
Perfect - thanks for the quick response! |
I just wanted to check if CUDA support is enabled in the recent 2.4.1 release? |
Hi @tbrandvik Unfortunately the CUDA support has not been added yet. |
Hi @enrico-usai, I wanted to check if there are any plans to add this in upcoming releases? Thanks, |
Hi, is there any update? |
Hi, |
The compute instance and the head node have shared disk (such as |
Resolving this since as part of 3.9.0 release we included EFA installer 1.30.0 that contains both openmpi40-aws-4.1.6-2 and openmpi50-aws-5.0.0-11. Open MPI 4 from EFA installer does not support CUDA-awareness. Open MPI 5 does. |
Environment:
aws-parallelcluster-2.4.0
centos7
slurm
c5n.large
p3.2xlarge
Bug description and how to reproduce:
The default Open-MPI (/opt/amazon/efa) is not compiled with CUDA support.
Steps to reproduce:
ompi_info --parsable --all | grep mpi_built_with_cuda_support:value
on a compute node with a GPUThe output is as follows, indicating that CUDA support is disabled
mca:mpi:base:param:mpi_built_with_cuda_support:value:false
Is there a particular reason why CUDA support is not enabled or would it be possible to compile the default Open-MPI version to support CUDA in future releases?
The text was updated successfully, but these errors were encountered: