Skip to content

Commit

Permalink
Merge pull request olcf#194 from maxpkatz/nsight_compute_bug_doc
Browse files Browse the repository at this point in the history
Note Spectrum MPI + Nsight Compute incompatibility
  • Loading branch information
grahamlopez authored Feb 3, 2020
2 parents f39d1a2 + c9c0b61 commit bb59a53
Showing 1 changed file with 36 additions and 1 deletion.
37 changes: 36 additions & 1 deletion systems/summit_user_guide.rst
Original file line number Diff line number Diff line change
Expand Up @@ -3089,11 +3089,46 @@ please see the `Vampir Software Page <https://www.olcf.ornl.gov/software_package
Known Issues
============

Last Updated: 04 December 2019
Last Updated: 01 February 2020

Open Issues
-----------

Nsight Compute cannot be used with MPI programs
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

When profiling an MPI application using NVIDIA Nsight Compute, like the following,
you may see an error message in Spectrum MPI that aborts the program:

::

jsrun -n 1 -a 1 -g 1 nv-nsight-cu-cli ./a.out
Error: common_pami.c:1049 - ompi_common_pami_init() Unable to create PAMI client (rc=1)
--------------------------------------------------------------------------
No components were able to be opened in the pml framework.

This typically means that either no components of this type were
installed, or none of the installed components can be loaded.
Sometimes this means that shared libraries required by these
components are unable to be found/loaded.

Host: <host>
Framework: pml
--------------------------------------------------------------------------
PML pami cannot be selected

This is due to an incompatibility in the 2019.x versions of Nsight Compute with
Spectrum MPI. As a workaround, you can disable CUDA hooks in Spectrum MPI using

::
jsrun -n 1 -a 1 -g 1 --smpiargs="-disable_gpu_hooks" nv-nsight-cu-cli ./a.out

Unfortunately, this is incompatible with using CUDA-aware MPI in your application.

This will be resolved in a future release of CUDA.

CUDA hook error when program uses CUDA without first calling MPI_Init()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Expand Down

0 comments on commit bb59a53

Please sign in to comment.