-
Notifications
You must be signed in to change notification settings - Fork 865
WeeklyTelcon_20160510
Jeff Squyres edited this page Nov 18, 2016
·
1 revision
- Dialup Info: (Do not post to public mailing list or public wiki)
- Jeff Squyres
- Brad Benton
- george
- Howard
- Josh Hursey
- Joshua Ladd
- Ralph Castain
- Geoff Paulsen
- Ryan Grant
- Todd Kordenbrock
- Sylvain Jeaugey
- Milestones: https://github.com/open-mpi/ompi-release/milestones/v1.10.3
- PMI Barrier - 2 PRs waiting for verification.
- When launched by SLURM, use PMIx ModeX and Blocked Opal Progress.
- Need Howard or Nathan to verify these two.
- A bunch of Hangs in 1.10 series, but noone can replicate by hand.
- Possibly MTT induced? Some looks like App is not hung, but MTT timeout.
- George identified a Blocker on patcher stuff C++ hang issue.
- PMI Barrier - 2 PRs waiting for verification.
- Schedule? Maybe end of next week another RC. *
- Wiki: https://github.com/open-mpi/ompi/wiki/Releasev20
- Blocker Issues: https://github.com/open-mpi/ompi/issues?utf8=%E2%9C%93&q=is%3Aopen+milestone%3Av2.0.0+label%3Ablocker
- 1663 hwloc fix go in after the call
- Ralph will fix configure logic around external pmix
- if user asked for external pmix, but can't find it, it doesn't fail, but could break at runtime.
- Milestones: https://github.com/open-mpi/ompi-release/milestones/v2.0.0 *
- File-get-byte-offset - Edger (not here), jeff will ask about progress.
- coll tuned, two proc errors
- User Migration Guide vs Developer Migration Guide on wiki:
- Call for developers to update new features listed on Wiki
- https://github.com/open-mpi/ompi/wiki/User-Migration-Guide:-1.8.x-and-v1.10.x-to-v2.0.0
- https://github.com/open-mpi/ompi/wiki/Developer-Migration-Guide:-v1.8.x-and-v1.10.x-to-v2.x
- Jenkins is having problems, one is induced by Ralph,
- Ralph needs help by Josh Hursey or Josh Ladd.
- Env variable forwarding.
Review Master MTT testing (https://mtt.open-mpi.org/)
-
min-dist mapper test failing. Jeff opened Issue 1623.
- Ralph has some fix on master.
-
IBM would like an explicit declaration of license the website / documentation is available under
- no objections.
- IBM will file a pull request, and email devel for more discussion.
- Some Discussion on MTT Timeouts
- Issue is that if MTT Timeout happens during timeout, it looks like a timeout, rather than a success.
- Josh considering adding some additional functionality to grab stack traced on hang.
- Geoff mentioned a possible feature in Platform-MPI could be added,
- Mellanox, Sandia, Intel
- LANL, Houston, IBM
- Cisco, ORNL, UTK, NVIDIA