Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update to most recent disco ompi #6

Merged
merged 622 commits into from
Feb 26, 2016
Merged

Conversation

eddy16112
Copy link
Owner

No description provided.

ggouaillardet and others added 30 commits December 24, 2015 14:41
Thanks Marco Atzeri for contributing the original patch
Thanks Marco Atzeri for contributing the original patch
1. finer grained locks;
2. separate srq creation from cq adjustments.
This nasty bug was nicely masked. It was causing `mca_btl_vader_component.vader_frags_user`
overflow and as the result rear hangs of ompi-test-suite.
Thanks Willem Vermin for bringing this to our attention
replace test ... -o ... with test ... || test ...
and test ... -a ... with test ... && test ...
…x API.

Update the configure logic for the new pmix120 component

ckpt

Get the pmix120 component to work - still not really registering or handling notifications, but infrastructure now operates

Cleanup some of the symbol scopes, and provide a more comprehensive rename.h file. Will pretty it up later - let's see how this works

Cleanup the rename files to use the pretty macros
…ersubsription

rmaps/round_robin: warn if oversubscribing when manually setting the …
Standardize the handling of shutdown in the OOB TCP component
hjelmn and others added 28 commits February 18, 2016 16:56
There is a bug in MPMD detection that disables totalview if a : is
found anywhere on the command line. This includes inside an argument
option or MCA variable value. This commit changes the check to look
for the string " : " instead of the character : which should eliminate
the issue in most cases.

Signed-off-by: Nathan Hjelm <[email protected]>
…s. Fix a few IOF issues reported by Mark Santcroos when submitting jobs from tools. Add the ability to pass directives to the --output-filename option that tell ORTE to (a) not include the jobid in the path to the output files, and (b) not to copy the output to the tool (i.e., just store it in the files).

ck

Remove stale debug

Fix a segfault if no subscribers are present
btl/openib: XRC fix bug that could cause an invalid SRQ# to be used
orterun: allow DDT if options contain :'s
Cleanup some valgrind complaints about jumps with uninitialized values.
This commit fixes a bug that can occur when communicating via XRC to
peers on the same node. UDCM was not saving the SRQ numbers on the
loopback endpoint (which shares its ib_addr info with all local peers)
so any messages to local peers use an invalid SRQ number.

Fixes open-mpi#1383

Signed-off-by: Nathan Hjelm <[email protected]>
btl/openib: XRC save SRQ#s on the loopback endpoint
…ers for number of sub groups and write chunk size. This will allow to perform a systematic parameter study.
 1. use communicator collectives if possible for performance reasons
 2. combined multiple allgathers into a single one
…e remote nodes output locally to the files instead of sending it all back to the HNP.

Fix Solaris issues by renaming struct field
Cleanup the output-filename options so they work as expected.
… grow forever as we run lots and lots of jobs in the persistent DVM.
Convert the orte_job_data pointer array to a hash table so it doesn't…
Tools don't create the orte_job_data table, so don't remove jobs from it
and accept a proc from a different job
also remove code duplication by using ompi_proc_complete_init_single()

Thanks Siegmar Gross for reporting this issue, and Ralph for the guidance.
eddy16112 added a commit that referenced this pull request Feb 26, 2016
update to most recent disco ompi
@eddy16112 eddy16112 merged commit ad0d5f1 into eddy16112:master Feb 26, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.