Skip to content

WeeklyTelcon_20190226

Geoffrey Paulsen edited this page Mar 12, 2019 · 2 revisions

Open MPI Weekly Telecon


  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees (on Web-ex)

  • Geoff Paulsen
  • Jeff Squyres
  • Akshay Venkatesh
  • Jake Hemstad
  • Josh Hursey
  • Matthew Dosanjh
  • Howard Pritchard
  • Ralph Castain
  • Xin Zhao
  • Brian Barrett
  • Nathan Hjelm
  • Thomas Naughton
  • Geoffroy Vallee
  • Dan Topa

not there today (I keep this for easy cut-n-paste for future notes)

  • Todd Kordenbrock
  • Josh Hursey
  • Joshua Ladd
  • Matias Cabral
  • David Bernholdt
  • George
  • Edgar Gabriel
  • Aravind Gopalakrishnan (Intel)
  • Dan Topa (LANL)
  • Akshay Venkatesh (nVidia)
  • Arm (UTK)
  • Peter Gottesman (Cisco)
  • mohan

Agenda/New Business

  • Nathan Hjelm's day job will no longer involve Open MPI, so if you want him to review something, please check with him first.
  • Next face to face is San Jose - April 23-April25 @ Cisco -San Jose.
  • Jake Hemstad from nVidia has some new use-cases for Open MPI he'd like to discuss
  • Should we meet next week during MPI Forum?
    • No meeting next week. Geoff will send out email.

Minutes

Review v3.0.x Milestones v3.0.3

  • Schedule RC maybe later this week.
  • Jeff will help Brian out
  • Waiting on a vew PR reviews (Brian)
  • New update needed.
  • Need to merge PR3248.
  • Merged in a bunch of changes, and MTT still looks good.

Review v3.1.x Milestones v3.1.0

  • Schedule RC maybe later this week.
  • Merging PRs this morning
  • Merged in a bunch of changes, and MTT still looks good.
  • Consider disabling pmix-new-shmem mca param. (see PMIx Issue 1114)

Review v4.0.x Milestones v4.0.1

v5.0.0

  • Schedule: Delaying post Summer ***
  • Discussion of schedule depends on scope discussion
    • if we want to separate Orte out for that? Would be a bit past summer.
    • Giles has a prototype of PRTE replacing ORTE
  • Want to open up release-manager elections.
    • Now that we're delaying, will decide at face2face.
  • Is anyone pushing for a Summer of 2019 schedule?
    • It seems too aggressive to everyone on the call
    • One driver was to remove things to break ABI.
    • Not a bad idea to DO v5.0, but summer timing is bad.
    • Delaying would allow for switching to PRTE.
    • PMIx Tools support
  • Now the possibility of v4.1 from master is a possibility
    • If we instead do a v4.1, some things we'd need fixed on master.
  • will discuss more at face to face.

Master

  • Good Job Ralph fixed the 100% Cisco MTT fail.
  • Cisco now has 70,000+ good runs. Still some static build issues.

PMIx

  • Take a look at Gile's PRTE work. He may have done SOME of that. He should have done that all in PRTE layer, maybe just some MPI layer work remains.

MTT

  • IBM still has 10% failure rate and build issue. Please fix.

New topics

  • Jake from nVidia discussed their intereste in a Use-case using
    • https://github.com/dask/dask-mpi/issues/25
    • Is there a way to setup
    • Two main approaches:
      • convert existing Dask processes into MPI processes
      • dynamicly create MPI processes with MPI Dynamic tasking.
    • Sounds like targeting PMIx directly might not be best path.
      • MPI_Sessions is designed to solve this.
    • Other things in PMIx that might want to access.
    • PMIx will be getting Python bindings about 80% done, summer?
  • Next week - March 4th is next MPI Forum (then June)
    • No Open MPI weekly web-ex next week.
  • We have a new open-mpi SLACK channel for Open MPI developers.
    • Not for users, just developers...
    • email Jeff If you're interested in being added.

face to face -

  • how do we get more participation, and make MTT more meaningful?

Review Master Master Pull Requests

  • didn't discuss today.

Oldest PR

Oldest Issue


Status Updates:

Status Update Rotation

  1. Mellanox, Sandia, Intel
  2. LANL, Houston, IBM, Fujitsu
  3. Amazon,
  4. Cisco, ORNL, UTK, NVIDIA

Back to 2018 WeeklyTelcon-2018

Clone this wiki locally