-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[MetaSchedule] Use
shared.dyn
for Tensor Core Schedule Rules (#13891)
This PR adds Tensor Core intrinsics with `shared.dyn` scope and changes the default rules to use `shared.dyn`. Here are the performance improvement of GEMM 1024x1024x1024 on my device (RTX-3080) | | Use `shared` | Use `shared.dyn`. | Speedup | | ----------- | --------------------- | -------------------- | ---------- | fp 16-16-16 | 66399.8766 GFLOPs | 71778.3808 GFLOPs | 8.1% | fp 16-16-32 | 44292.5893 GFLOPs | 49070.2514 GFLOPS | 10.8% | cc @vinx13 @junrushao @masahi
- Loading branch information
Showing
5 changed files
with
198 additions
and
119 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.