Skip to content

WeeklyTelcon_20160510

Jeff Squyres edited this page Nov 18, 2016 · 1 revision

Open MPI Weekly Telcon


  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees

  • Jeff Squyres
  • Brad Benton
  • george
  • Howard
  • Josh Hursey
  • Joshua Ladd
  • Ralph Castain
  • Geoff Paulsen
  • Ryan Grant
  • Todd Kordenbrock
  • Sylvain Jeaugey

Agenda

Review 1.10

  • Milestones: https://github.com/open-mpi/ompi-release/milestones/v1.10.3
    • PMI Barrier - 2 PRs waiting for verification.
      • When launched by SLURM, use PMIx ModeX and Blocked Opal Progress.
      • Need Howard or Nathan to verify these two.
    • A bunch of Hangs in 1.10 series, but noone can replicate by hand.
    • Possibly MTT induced? Some looks like App is not hung, but MTT timeout.
    • George identified a Blocker on patcher stuff C++ hang issue.
  • Schedule? Maybe end of next week another RC. *

Review 2.0.x

Master PRs

  • File-get-byte-offset - Edger (not here), jeff will ask about progress.
  • coll tuned, two proc errors

v2.0 Migration Guides

Jenins on Master

  • Jenkins is having problems, one is induced by Ralph,
    • Ralph needs help by Josh Hursey or Josh Ladd.
    • Env variable forwarding.

Review Master MTT testing (https://mtt.open-mpi.org/)

  • min-dist mapper test failing. Jeff opened Issue 1623.

    • Ralph has some fix on master.
  • IBM would like an explicit declaration of license the website / documentation is available under

    • no objections.
    • IBM will file a pull request, and email devel for more discussion.

MTT Dev status:

  • Some Discussion on MTT Timeouts
    1. Issue is that if MTT Timeout happens during timeout, it looks like a timeout, rather than a success.
    2. Josh considering adding some additional functionality to grab stack traced on hang.
    3. Geoff mentioned a possible feature in Platform-MPI could be added,

Status Updates:


Status Update Rotation

  1. Mellanox, Sandia, Intel
  2. LANL, Houston, IBM
  3. Cisco, ORNL, UTK, NVIDIA

Back to 2016 WeeklyTelcon-2016

Clone this wiki locally