Disable custom all reduce by default #2808

WoosukKwon · 2024-02-07T23:37:22Z

This PR temporarily disables the custom all-reduce kernels. We will enable them once the stability issues are resolved.

This reverts commit 3711811.

[ROCm] Fix build problem resulted from previous commit related to FP8 kv-cache support (vllm-project#2790) Add documentation on how to do incremental builds (vllm-project#2796) [Ray] Integration compiled DAG off by default (vllm-project#2471) Disable custom all reduce by default (vllm-project#2808) add usage context removed usage_context from Engine_args Move IO to another process added http request [ROCm] support Radeon™ 7900 series (gfx1100) without using flash-attention (vllm-project#2768) Add documentation section about LoRA (vllm-project#2834) Refactor 2 awq gemm kernels into m16nXk32 (vllm-project#2723) Co-authored-by: Chunan Zeng <[email protected]> Added additional arg for from_engine_args comments

Disable custom all reduce by default

2952ec4

WoosukKwon requested review from zhuohan123 and simon-mo February 8, 2024 02:47

Yard1 approved these changes Feb 8, 2024

View reviewed changes

simon-mo merged commit 3711811 into main Feb 8, 2024
17 checks passed

hanzhi713 added a commit to hanzhi713/vllm that referenced this pull request Feb 10, 2024

Revert "Disable custom all reduce by default (vllm-project#2808)"

594d24b

This reverts commit 3711811.

alexm-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Feb 13, 2024

Disable custom all reduce by default (vllm-project#2808)

92fa8db

jvmncs pushed a commit to jvmncs/vllm that referenced this pull request Feb 14, 2024

Disable custom all reduce by default (vllm-project#2808)

5c40715

WoosukKwon deleted the disable-custom-ar branch February 15, 2024 06:47

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2024

Disable custom all reduce by default (vllm-project#2808)

cbbd9b6

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Disable custom all reduce by default (vllm-project#2808)

ff70d5b

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Disable custom all reduce by default (vllm-project#2808)

22f88f2

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Disable custom all reduce by default (vllm-project#2808)

d366b67

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable custom all reduce by default #2808

Disable custom all reduce by default #2808

WoosukKwon commented Feb 7, 2024

Disable custom all reduce by default #2808

Disable custom all reduce by default #2808

Conversation

WoosukKwon commented Feb 7, 2024