Understanding Running MPI with SLURM

What this code does

This code uses the trapezoidal rule to integrate a function, y(x) over the interval [x_{min}, x_{max}]. It breaks up the integration across multiple processors using MPI. It does this integration 100 times, in order to make it take minutes to complete. This ensures that there is time to inspect the processes running on different computers.

I use this code to better understand how to run MPI jobs in SLURM.

Dependencies

MPI, tested with openmpi-1.8 and mvapich2-2.1
C-compiler, tested with GCC/4.9.2
SLURM, tested with 17.11.7.

NOTE : We use modules to manage software on our system. Any mechanism to get the compilers and commands in your shell PATH is acceptable.

Examples

In order for these examples to work, you need to ensure that the number of tasks (see salloc --ntasks) you request is greater than or equal to the number of cores you request when with mpirun (see the -np argument). Also, when you have an interactive allocation, it is VERY important to have a secure connection. If your connection drops, then your interactive session gets released.

Example 1 - Interactive :

Working on the login node to the cluster, we make an allocation. salloc spawns a new shell (still on the login node). From this new shell, we can run our MPI job. SLURM behind the scenes dispatches the work from the login node to the interactive allocation. stdout gets routed back to the login node. After you execute mpirun, log into the nodes and run top to inspect.

$ module load openmpi-1.8/gcc-4.9.2
$ mpicc mpi_integrate.c 
$ salloc --ntasks=2 --nodes=2     # Gets two nodes with 1 core each, spawns new shell
     (within new shell...)       
$ mpirun -np 2 ./a.out            # Works as expected, both cores, one on each node does work.
                                  # Use a seperate shell to log into each node in allocation
                                  # and run `top`

Once a.out is finished, you still have the allocation you requested in SLURM. This is particularly useful if you need a relatively large interactive allocation on a SLURM cluster.

Example 2 - Interactive, what NOT to do :

Working on the login node to the cluster, we make an allocation. Then from any shell try to access the allocation.

$ module load openmpi-1.8/gcc-4.9.2 # Get MPI into our PATH
$ mpicc mpi_integrate.c 
$ salloc --ntasks=2 --nodes=2       # Gets two nodes with 1 core each
     (from any shell...)
$ srun --mpi=openmpi --jobid=XXXX mpirun -np 2 ./a.out # WHOOPS! This is basically running two instances
                                                       # of `mpirun -np 2 ./a.out`, one on each node
                                                       # For a total of 4 MPI tasks, not the 2 expected

Example 3 - Batch

This is a batch job. The script does the same work as Example 1, non-interactively.

$ sbatch submit.sh

Key Take Aways

Number of tasks/cores requested by mpirun (see mpirun -np) must be less than or equal to number of tasks requested from SLURM (see salloc --ntasks)
DO NOT USE srun in combination with sallocwhen running an interactive MPI job.
For interactive jobs, you MUST have a secure connection to the cluster.
Presumably if you want to utilize MPI and OpenMP, you'd need to specify salloc --cpus-per-task. The would allow each MPI task to utilize multiple cores. Still needs tested.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
WHERE_I_LEFT_OFF.txt		WHERE_I_LEFT_OFF.txt
mpi_integrate.c		mpi_integrate.c
mwe.c		mwe.c
submit.sh		submit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding Running MPI with SLURM

What this code does

Dependencies

Examples

Example 1 - Interactive :

Example 2 - Interactive, what NOT to do :

Example 3 - Batch

Key Take Aways

About

Releases

Packages

Languages

astrophys/mpi_intuition_with_slurm

Folders and files

Latest commit

History

Repository files navigation

Understanding Running MPI with SLURM

What this code does

Dependencies

Examples

Example 1 - Interactive :

Example 2 - Interactive, what NOT to do :

Example 3 - Batch

Key Take Aways

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages