[rocm6.1] Update Triton dependency logic #47

jithunnair-amd · 2024-03-21T20:35:01Z

Reinstate PyTorch dependency on hashed version of triton (essentially reverts e725944): Tested successfully via http://rocm-ci.amd.com/job/rocm-pytorch-manylinux-wheel-builder/154
Do not add triton dependency for PyTorch versions older than 1.13: Tested successfully via http://rocm-ci.amd.com/job/rocm-pytorch-manylinux-wheel-builder/155

jithunnair-amd · 2024-03-22T08:19:04Z

Test results of PyTorch2.0 and triton wheels generated from http://rocm-ci.amd.com/job/rocm-pytorch-manylinux-wheel-builder/154, installed in rocm/pytorch-private:manylinux-hipclang-13484:

[root@ed0329edef16 apex]# pip3 install -f https://compute-artifactory.amd.com/artifactory/compute-pytorch-rocm/compute-rocm-dkms-no-npi-hipclang/13484/ torch==2.0.1 torchvision==0.15.2
Looking in links: https://compute-artifactory.amd.com/artifactory/compute-pytorch-rocm/compute-rocm-dkms-no-npi-hipclang/13484/
Collecting torch==2.0.1
  Downloading https://compute-artifactory.amd.com/artifactory/compute-pytorch-rocm/compute-rocm-dkms-no-npi-hipclang/13484/torch-2.0.1%2Brocm6.1-cp39-cp39-linux_x86_64.whl (2095.9 MB)
...
Collecting pytorch-triton-rocm==2.0.2+f84c1f1e62 (from torch==2.0.1)
  Downloading https://compute-artifactory.amd.com/artifactory/compute-pytorch-rocm/compute-rocm-dkms-no-npi-hipclang/13484/pytorch_triton_rocm-2.0.2%2Bf84c1f1e62-cp39-cp39-linux_x86_64.whl (206.3 MB)
...

[root@ed0329edef16 pytorch-micro-benchmarking]# python3 micro_benchmarking_pytorch.py --network resnet50 --compile
INFO: running forward and backward for warmup.
/opt/python/cp39-cp39/lib/python3.9/site-packages/torch/_inductor/compile_fx.py:90: UserWarning: TensorFloat32 tensor cores for float32 matrix multiplication available but not enabled. Consider setting `torch.set_float32_matmul_precision('high')` for better performance.
  warnings.warn(
INFO: running the benchmark..
OK: finished running benchmark..
--------------------SUMMARY--------------------------
Microbenchmark for network : resnet50
Num devices: 1
Dtype: FP32
Mini batch size [img] : 64
Time per mini-batch : 0.11212306022644043
Throughput [img/sec] : 570.8014022338267

* Use hashed triton version * skip triton dependency for PyTorch < 2.0 (cherry picked from commit 4f4f6bd)

* Use hashed triton version * skip triton dependency for PyTorch < 2.0 (cherry picked from commit 4f4f6bd) Add ROCM_VERSION to triton dependency (#48) (cherry picked from commit c74ab20) Use ROCm version with patch for triton dependency (#49)

jithunnair-amd added 2 commits March 21, 2024 20:29

Use hashed triton version

e9e79a1

skip triton dependency for PyTorch < 2.0

608d4f4

jithunnair-amd force-pushed the rocm6.1_with_triton_hash branch from be72ec9 to 608d4f4 Compare March 22, 2024 21:24

jithunnair-amd marked this pull request as ready for review March 22, 2024 21:24

jithunnair-amd changed the title ~~Update Triton dependency logic~~ [rocm6.1] Update Triton dependency logic Mar 22, 2024

jithunnair-amd merged commit 4f4f6bd into rocm6.1 Mar 22, 2024
2 checks passed

jithunnair-amd deleted the rocm6.1_with_triton_hash branch March 22, 2024 21:30

jithunnair-amd added a commit that referenced this pull request Apr 11, 2024

[rocm6.1] Update Triton dependency logic (#47)

d0339fc

* Use hashed triton version * skip triton dependency for PyTorch < 2.0 (cherry picked from commit 4f4f6bd)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rocm6.1] Update Triton dependency logic #47

[rocm6.1] Update Triton dependency logic #47

jithunnair-amd commented Mar 21, 2024 •

edited

Loading

jithunnair-amd commented Mar 22, 2024

[rocm6.1] Update Triton dependency logic #47

[rocm6.1] Update Triton dependency logic #47

Conversation

jithunnair-amd commented Mar 21, 2024 • edited Loading

jithunnair-amd commented Mar 22, 2024

jithunnair-amd commented Mar 21, 2024 •

edited

Loading