[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if Triton is not used #4658

alexeykondrat · 2024-05-07T16:30:32Z

Navi3X - have major HW version 11, Navi21/Navi10 have HW version 10, MI series - HW version 9.

https://github.com/ROCm/FasterTransformer-Internal/issues/247

WoosukKwon

@alexeykondrat Thank for submitting the PR! Could you please fix the comment a bit? Thanks!

vllm/attention/backends/rocm_flash_attn.py

… Triton is not used (vllm-project#4658)

Adding Navi21 to fallback to naive attention if Triton is not used

a663b73

WoosukKwon added the rocm label May 7, 2024

WoosukKwon approved these changes May 16, 2024

View reviewed changes

vllm/attention/backends/rocm_flash_attn.py Outdated Show resolved Hide resolved

Update vllm/attention/backends/rocm_flash_attn.py

73f22c4

WoosukKwon enabled auto-merge (squash) May 18, 2024 04:03

WoosukKwon merged commit c0724fc into vllm-project:main May 18, 2024
59 checks passed

robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request May 19, 2024

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if…

3bbe65e

… Triton is not used (vllm-project#4658)

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if…

94a7c8b

… Triton is not used (vllm-project#4658)

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if…

9b620b7

… Triton is not used (vllm-project#4658)

alexeykondrat deleted the navi-fix branch September 11, 2024 01:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if Triton is not used #4658

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if Triton is not used #4658

alexeykondrat commented May 7, 2024

WoosukKwon left a comment

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if Triton is not used #4658

[ROCm][Hardware][AMD] Adding Navi21 to fallback to naive attention if Triton is not used #4658

Conversation

alexeykondrat commented May 7, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment