Profiling MPI and benchmarking strong + weak scaling #3002

ali-ramadhan · 2021-03-10T17:09:21Z

ali-ramadhan
Mar 10, 2021
Maintainer

In PR #590 I added a small/quick strong scaling test and @francispoulin calculated the scaling efficiency which wasn't super great:

np       efficiency
==       ==========
2         0.96
4         0.71
8         0.62
16        0.56

I guess to improve performance we should do some MPI profiling to find bottlenecks. Could also benchmark the distributed pressure solve and the halo filling separately to see how they scale as well.

Might also make sense to benchmark scaling with ShallowWaterModel to see if it's an IncompressibleModel issue. Might need a pretty large domain to see good scaling with a 2D shallow water model?

@tomchor pointed out that the benchmark could be flawed. We should make sure everything is compiled. Could also try different sizes and a weak scaling benchmark in case the 1D/slab decomposition isn't helping.

Maybe trying on a different machine too. Not sure if there's a "proper" setup for doing these scaling benchmarks.

Bad scaling efficiency might also be a sign of missing barriers/waits?

@vchuravy We might ask for your help!

vchuravy · 2021-03-10T18:23:45Z

vchuravy
Mar 10, 2021
Collaborator

We might ask for your help!

Happy to help.

0 replies

francispoulin · 2021-03-10T18:27:24Z

francispoulin
Mar 10, 2021
Collaborator

Thanks @ali-ramadhan for doing this. I wonder if we could modify this script and run it on ShallowWaterModel to start doing some strong scaling tests for that model?

0 replies

ali-ramadhan · 2021-03-16T18:47:05Z

ali-ramadhan
Mar 16, 2021
Maintainer Author

Got some helpful replies from Julia Discourse: https://discourse.julialang.org/t/how-to-profile-julia-mpi-code/57136/4

Leading suggestion by @simonbyrne is to try using NVIDIA Nsight which might allow us to do GPU profiling and MPI profiling!

0 replies

tomchor · 2021-03-16T20:17:11Z

tomchor
Mar 16, 2021
Collaborator

This registration is still open: https://portal.xsede.org/course-calendar/-/training-user/class/2310/session/3970

It's free and it'll happen on Thursday. I'm considering attending myself

0 replies

ali-ramadhan · 2021-03-17T00:49:13Z

ali-ramadhan
Mar 17, 2021
Maintainer Author

Thanks for the heads up, just signed up!

0 replies

francispoulin · 2021-03-17T01:20:54Z

francispoulin
Mar 17, 2021
Collaborator

Thanks, and me too!

0 replies

glwagner · 2023-03-22T16:13:13Z

glwagner
Mar 22, 2023
Maintainer

@simone-silvestri has done a bit of this. @simone-silvestri feel free to post your results here. I'm converting this to a discussion.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Profiling MPI and benchmarking strong + weak scaling #3002

{{title}}

Replies: 7 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Profiling MPI and benchmarking strong + weak scaling #3002

ali-ramadhan Mar 10, 2021 Maintainer

Replies: 7 comments

vchuravy Mar 10, 2021 Collaborator

francispoulin Mar 10, 2021 Collaborator

ali-ramadhan Mar 16, 2021 Maintainer Author

tomchor Mar 16, 2021 Collaborator

ali-ramadhan Mar 17, 2021 Maintainer Author

francispoulin Mar 17, 2021 Collaborator

glwagner Mar 22, 2023 Maintainer

ali-ramadhan
Mar 10, 2021
Maintainer

vchuravy
Mar 10, 2021
Collaborator

francispoulin
Mar 10, 2021
Collaborator

ali-ramadhan
Mar 16, 2021
Maintainer Author

tomchor
Mar 16, 2021
Collaborator

ali-ramadhan
Mar 17, 2021
Maintainer Author

francispoulin
Mar 17, 2021
Collaborator

glwagner
Mar 22, 2023
Maintainer