Skip to content

Release 3.8.0 corresponding to NGC container 21.02

Compare
Choose a tag to compare
@dzier dzier released this 26 Feb 21:16
· 1 commit to r21.03 since this release

PyProf - PyTorch Profiler for GPUs

PyProf is a tool that profiles and analyzes the GPU performance of PyTorch models. PyProf aggregates kernel performance from Nsight Systems or NvProf. The tool can identify the layer that launched the kernel, identify the tensor dimensions and precision, perform forward-backward correlation, determine Tensor Core usage, and correlate the line in the user's code that launched a particular kernel.

What's New In 3.8.0

  • Torch.ones and Torch.zeros input can now be a list, tuple, or a sequence of integers