Performance tools #185
Unanswered
matthiasdiener
asked this question in
General
Replies: 2 comments 10 replies
-
Did you try https://vmprof.readthedocs.io/en/latest/? |
Beta Was this translation helpful? Give feedback.
4 replies
-
It would be very useful to be able to pick apart the operator performance. Currently it looks like all one blob "stepping" - from which it is sort of hard to extract useful info. Would like to see performance characteristics of (some of which you already mentioned):
Actually we are currently facing a "performance situation" where having the boundary timings would be great, and would have been great to help us catch something earlier. |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Overview of the current status of performance measurement tools used with mirgecom.
What we have
(Internal) 1. OpenCL kernel profiling via the builtin profiling
(Internal) 2. Performance logging via PR #96
(External) 3. Python profiling
$ pip install pyinstrument
, then replacepython
withpyinstrument
on command line:$ pyinstrument -m mpi4py examples/wave-eager-mpi.py
(External) 4. Low-level profiling (VTune, perf, nvprof, ...)
What we don't have
The issues below make it difficult to get a comprehensive profile at the moment.
Issue 1: Matching OpenCL kernels to the Python code
Issue 2: Missing short(?) Python functions from profiling
cProfile
cProfile
for now.Issue 3: Low-level (non-kernel) performance
Issue 4: Mismatch between OpenCL kernel profile and nvprof
Thoughts?
Beta Was this translation helpful? Give feedback.
All reactions