You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently AdvancedProfiler only outputs textual summary where the rows are sorted by cumulative time. Unfortunately this does not give a full picture about the structure and duration of call stacks involved.
Pitch
Raw cProfiler dumps provide more information to help identify bottlenecks than just module names and line numbers. E.g. using https://jiffyclub.github.io/snakeviz one can see the hierarchy of the call stacks and sort/drill-down using an interactive interface.
Luckily there's not much needed to make this available. AdvancedProfiler already collects profiling data we can easily dump using dump_stats():
Description & Motivation
Currently
AdvancedProfiler
only outputs textual summary where the rows are sorted by cumulative time. Unfortunately this does not give a full picture about the structure and duration of call stacks involved.Pitch
Raw cProfiler dumps provide more information to help identify bottlenecks than just module names and line numbers. E.g. using https://jiffyclub.github.io/snakeviz one can see the hierarchy of the call stacks and sort/drill-down using an interactive interface.
Luckily there's not much needed to make this available.
AdvancedProfiler
already collects profiling data we can easily dump using dump_stats():I'm willing to contribute this but would like to clarify the best configuration and injection options. Thanks!
Alternatives
Write a custom callback and do the profiling from there - thus not using the one provided by PyTorch Lightning.
Additional context
Related to #7424
cc @Borda @carmocca
The text was updated successfully, but these errors were encountered: