Skip to content

Use model-cast-to-bfloat16 rather than AMP-to-bfloat16 for inference.#9198

Merged
titu1994 merged 2 commits intoNVIDIA:mainfrom galv:dgalvez/fix-autocast-slowness-2Jun 6, 2024

Commits

Commits on Jun 5, 2024