-
Notifications
You must be signed in to change notification settings - Fork 864
WeeklyTelcon_20191008
- Dialup Info: (Do not post to public mailing list or public wiki)
- Geoffrey Paulsen (IBM)
- Jeff Squyres (Cisco)
- Austen Lauria (IBM)
- Brendan Cunningham (Intel)
- Brian Barrett (AWS)
- Edgar Gabriel (UH)
- Erik Zeiske
- Harumi Kuno (HPE)
- Howard Pritchard (LANL)
- Matthew Dosanjh (Sandia)
- Michael Heinz (Intel)
- Ralph Castain (Intel)
- Thomas Naughton (ORNL)
- Todd Kordenbrock (Sandia)
- William Zhang (AWS)
- Akshay Venkatesh (NVIDIA)
- Joshua Ladd (Mellanox)
- Noah Evans (Sandia)
- Artem Polyakov (Mellanox)
- Brandon Yates (Intel)
- Charles Shereda (LLNL)
- David Bernhold (ORNL)
- George Bosilca (UTK)
- Josh Hursey (IBM)
- Mark Allen (IBM)
- Matias Cabral (Intel)
- Nathan Hjelm (Google)
- Tom Naughton
- Xin Zhao (Mellanox)
- mohan (AWS)
-
Jeff changed a setting, and they seem to be working now.
-
Introduced Austen Lauria (IBM) who will be working more directly with Open MPI
-
Intel is going to move a different direction than prte, due to a fully PMIx compliant resource manager.
-
Ralph will be retiring, though answering email.
-
OMPI has been waiting for some git submodule work in Jenkins on AWS.
- Need someone to have someone to figure out why Jenkins doesn't like Jeff's PR.
- Anyone with github account for ompi team should have access.
- PR 6821
- Apparently Jenkin's isn't behaving as it should.
- Three pieces: Jenkins, CI, bot.
- AWS has a libfabirc setup like this for testing.
- Issue is that they're reworking the design, and will rollout for both libfabric and open-mpi.
- William Zhang talked to Brian
- Not something AWS team will work on, but Brian will work on it.
- Jeff will talk to Brian as well.
- Need someone to have someone to figure out why Jenkins doesn't like Jeff's PR.
-
Howard and Jeff have access to Jenkins on AWS. Part of the problem is that we don't have much expertise on Jenkins/AWS.
- William will probably be admining the Jenkins/AWS or communicating with those who will.
-
Merged
--recurse-submodules
update intoompi-scripts
Jenkins script as first step. Let's see if that works. -
Modular thread re-write (noah)
- UGNI and Vader BTLs were getting better performance, not sure why.
- For modular threading library, might be interesting to decide at compile time or runtime.
- Previously similar things seemed to be related to ICACHE.
- Howard will lok at.
Blockers All Open Blockers
Review v3.0.x Milestones v3.0.4
Review v3.1.x Milestones v3.1.4
- Release goal of Oct 31st.
- Need to put an RC out soon (will discuss date with Brian)
- Start drawing up a list of fixes that won't be backported to v3.0.x
- Datatype bug won't be backported, because it snowballed too big.
- Will put out a list at new 3.0.x and 3.1.x releases of issues fixed in v4.0.x that's NOT being backported... please upgrade, in either NEWS or README.
Review v4.0.x Milestones v4.0.2
-
Put out v4.0.2rc3 Monday
-
Release v4.0.2 this week.
-
XPMEM is failing, Howard will create issue.
-
Geoffroy Vallee has a system setup to run cross-compatibility, and can report out which versions are failing. Ralph will forward info to devel-core.
Review Master Master Pull Requests
- Compile failure on Master - OFI / Libfabric
- Fixed on v4.0.x, needs to be cherry-picked to master and other branches.
- OMPI_UNLIKELY versus OPAL_UNLIKELY
- IBM's PGI test has NEVER worked. Is it a real issue or local to IBM.
- Absoft 32bit fortran failures.
- Schedule: April 2020?
- Wiki page
- Some items:
- MPI1 removed stuff.
- Need a Face to face.
- Jeff will send out face-to-face doodle for weeks in Jan/Feb
- No discussion this week.
- See older weekday notes for prior items.
- No discussion this week.
- See older weekday notes for prior items.
- Please fill out doodle: https://doodle.com/poll/3mfn8ea74yx89dzh