Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CUTLASS] Profile only the largest-possible alignment by default #10036

Merged
merged 15 commits into from
Jan 26, 2022

Conversation

masahi
Copy link
Member

@masahi masahi commented Jan 24, 2022

When we can use 8-elements alignment for fp16, for example, there is no need to profile smaller alignment variants since align8 ones are always faster than align4, align2 etc. This should result in 3-4x speedup in tuning time.

Other changes

@comaniac @Laurawly

Copy link
Contributor

@comaniac comaniac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise LGTM

python/tvm/contrib/cutlass/build.py Outdated Show resolved Hide resolved
@masahi masahi force-pushed the cutlass-largest-alignment branch 2 times, most recently from 19a8f8c to 340bc5d Compare January 24, 2022 19:42
@masahi masahi force-pushed the cutlass-largest-alignment branch from e5b8825 to 771c8b7 Compare January 25, 2022 01:10
@masahi masahi force-pushed the cutlass-largest-alignment branch from 771c8b7 to 6de3d64 Compare January 26, 2022 00:54
@masahi masahi merged commit 1b9b05e into apache:main Jan 26, 2022
sunggg pushed a commit to sunggg/tvm that referenced this pull request Jan 29, 2022
…che#10036)

* introduce profile_all_alignments option

* add profile_all_alignment option to API

* wip

* fixed dynamic case

* black

* update gen_gemm too

* minor improvement

* fix

* all tests work

* add doc

* fixed for sm = 75 case

* fix typo

* remove unused import

* profile_all -> find_first_valid

* fix
ylc pushed a commit to ylc/tvm that referenced this pull request Feb 16, 2022
…che#10036)

* introduce profile_all_alignments option

* add profile_all_alignment option to API

* wip

* fixed dynamic case

* black

* update gen_gemm too

* minor improvement

* fix

* all tests work

* add doc

* fixed for sm = 75 case

* fix typo

* remove unused import

* profile_all -> find_first_valid

* fix
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants