[Software pipeline] Fix hardcoded index in `access_ptr` rewriting, add a GPU test with depth 4 #11495

masahi · 2022-05-27T19:36:56Z

Fix a hardcoded index in access_ptr rewriting, which assumes that the number of stages is 2.

Refactored MMA code in test_tir_schedule_tensorize_ldmatrix_mma.py, so that it can be used by other tests. The new test in test_tir_transform_inject_software_pipeline.py applies software pipelining annotations to the MMA-tensorized schedule with software_pipeline_stage = [0, 0, 3], which makes global to shared load pipelined with depth 4. Without async copy, this is not useful for performance. But it does demonstrate that a multi-stage pipeline with depth > 2 works on a semi-realistic GPU schedule.

The test uses large dynamic shared memory, which serves as a test case for #11478.

@vinx13 @junrushao1994 @csullivan

masahi changed the title ~~[Software pipeline] Fix hardcoded index in access_ptr rewriting, add a GPU test with depth 3~~ [Software pipeline] Fix hardcoded index in access_ptr rewriting, add a GPU test with depth 4 May 27, 2022

vinx13 approved these changes May 27, 2022

View reviewed changes

masahi added 8 commits May 28, 2022 05:36

fixed hard-coded index in software pipeling

d4769f7

fixed three-stage pipeline test

cf4dc36

add three stage pipelined gemm test

869fb97

refactor mma test

966d0c0

use mma_4k schedule utility in test

24af027

apply pipeling annotation

3d8b3cc

black

c403252

require ampere in test

853a128

masahi force-pushed the software-pipe-index-fix branch from b0b3a40 to 853a128 Compare May 27, 2022 20:36

masahi merged commit 2389f1f into apache:main May 28, 2022

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Software pipeline] Fix hardcoded index in `access_ptr` rewriting, add a GPU test with depth 4 #11495

[Software pipeline] Fix hardcoded index in `access_ptr` rewriting, add a GPU test with depth 4 #11495

masahi commented May 27, 2022 •

edited

Loading

[Software pipeline] Fix hardcoded index in access_ptr rewriting, add a GPU test with depth 4 #11495

[Software pipeline] Fix hardcoded index in access_ptr rewriting, add a GPU test with depth 4 #11495

Conversation

masahi commented May 27, 2022 • edited Loading

[Software pipeline] Fix hardcoded index in `access_ptr` rewriting, add a GPU test with depth 4 #11495

[Software pipeline] Fix hardcoded index in `access_ptr` rewriting, add a GPU test with depth 4 #11495

masahi commented May 27, 2022 •

edited

Loading