-
Notifications
You must be signed in to change notification settings - Fork 865
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MPAI broken with OpenMPI 3.0.1, pmix-1.2.5 and intel compiler #5260
Comments
are you saying pmix 1.2.5 does not work but pmix 2.1.1 does work ? |
note Open MPI 3.0.2 has been released, so I encourage you to upgrade to the latest version in order to get the latest bug fixes. |
Yes, it does work with pmix 2.1.1 but does not work with pmix 1.2.5. Sorry
if that wasn't clear.
I will verify if the issue exists with OpenMPI v3.0.2.
…On Tue, 12 Jun 2018, 00:41 Gilles Gouaillardet, ***@***.***> wrote:
note Open MPI 3.0.2 has been released, so I encourage you to upgrade to
the latest version in order to get the latest bug fixes.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#5260 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/ATEcL_EtEgDyYylQqHJ7fz-YiUQxPdGUks5t7wAcgaJpZM4UiraT>
.
|
I've reproduced the above issue with OpenMPI 3.0.2 and 3.1.0 using PMIx 1.2.5. I'd also like to add I'm using Intel Compiler 18.2. |
Related to #5501 |
Hey @rhc54 -- is this fixed in Open MPI 3.1.x HEAD (which has PMIx 2.1.3)? |
I suspect it will work as the user indicated that OMPI 3.x works fine with PMIx v2.1.x - it is the older PMIx v1.2.x that isn't compatible. I doubt that anyone is going to fix that problem. |
A few questions:
Additionally: how do we document this kind of stuff to the user? (i.e., OMPI / external PMIX compatibility -- does that reduce to https://docs.google.com/spreadsheets/d/1SwkUEzbFb1TvKuwHzOnPgjkW6OhmGmt1yrH3YN2IUQw/edit#gid=497420864, and does that need to be communicated to the user community somehow? |
I can answer 1. MPAI is the MPIR Process Acquisition Interface. It allows tools to locate jobs by debugging the starter process. I think I've seen most people refer to it as MPIR but I was reading the following at the time, https://www.mpi-forum.org/docs/mpir-specification-10-11-2010.pdf, which referred to it as MPAI. |
For the second question: I suspect the MPIR connection is a red herring. PMIx has nothing to do with MPIR. If it is generally true that OMPI 3.x is broken with PMIx v1.2.5, then the only suggestion I can make is to simply change the configure logic to reject the older version - or for someone to fix the integration issue 😄 |
FWIW, DDT attaching to MPI processes seems to work fine for me at the OMPI git head, the v3.0.x HEAD, and the v4.0.x HEAD -- all using the embedded PMIx, that is. That's not quite the same as what @knjmooney is testing, but I figured I'd at least provide those data points. |
This issue is stale. AFAIK, this has been fixed on the head of the v3.0.x, v3.1.x, and v4.0.x branches. |
I've compiled OpenMPI v3.0.1 with the following configure line
I've reduced the problem to the following
With pmix-1.2.5, the process hangs and I see the following message
ORTE_ERROR_LOG: Not supported in file orted/pmix/pmix_server_gen.c at line 362
. With pmix-2.1.1 the program runs to completion.I've attached bug_info.tar.gz which contains the output of
ompi_info --all
andcat config.log
for both compilation of OpenMPI 3.0.1 with pmix-1.2.5 and pmix-2.1.1. It also containshello.c
.I don't know if the issue is specific to the intel compiler but I was unable to configure with an external pmix and the gnu compiler.
The text was updated successfully, but these errors were encountered: