Distributed Mesh Class #111

kpwelsh · 2023-10-17T17:19:06Z

For any kind of proper scalability, we rely on MPI. As it stands, we have separate code on top of our mesh classes that operate with MPI. We use Zoltan2 for node balancing and TPetra for MPI communications.

We should consolidate this into a single layer on top of a mesh class. The availability of out of the box distributed mesh scaling can be significantly valuable for ELEMENTS users.

The distributed mesh would handle:

Node balancing
Process mapping during read/write
MPI Communications

During this process, we should consider separating from TPetra MultiVectors for MPI comms and implement our own for two reasons:

The layout of MultiVectors is contrary to our own data layout for several arrays. This results in substantial over-communication when we have to transpose them. While only a mild slowdown now, there are future methods that we anticipate to take a much bigger hit.
We would like to take advantage of MPI Direct GPU comms in the future, and TPetra is unlikely to support this.

kpwelsh self-assigned this Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Mesh Class #111

Distributed Mesh Class #111

kpwelsh commented Oct 17, 2023

Distributed Mesh Class #111

Distributed Mesh Class #111

Comments

kpwelsh commented Oct 17, 2023