Change scheduler & input tensor shape #1381

WoosukKwon · 2023-10-17T00:42:03Z

This PR updates the scheduler and model code to use 2D tensors instead of 1D tensors. The change will enable using a wider range of libraries and hardware, and facilitate future optimizations like CUDA graph.

zhuohan123

LGTM! A small change may be needed in the comment.

vllm/model_executor/input_metadata.py

yunfeng-scale · 2023-10-19T17:02:34Z

~~i'm worried about performance degradation of these paddings, can you try benchmarking?~~ sorry it seems the number of elements don't change so shouldn't affect performance

frankxyy · 2024-03-19T09:11:49Z

Hi, does the changing from 2D tensor to 1D tensor have some bad effects on the prefilling throughput? As some sequences with different lengths will be padded to the max length before calculation. Thus the total FLOPs needed is enhanced.

WoosukKwon requested a review from zhuohan123 October 17, 2023 00:42

Change scheduler & input shape

2c4f501

WoosukKwon force-pushed the fix-scheduler branch from ba68ca4 to 2c4f501 Compare October 17, 2023 00:45

zhuohan123 approved these changes Oct 17, 2023

View reviewed changes

vllm/model_executor/input_metadata.py Outdated Show resolved Hide resolved

Minor

537c49c

WoosukKwon merged commit c1376e0 into main Oct 17, 2023
2 checks passed

WoosukKwon deleted the fix-scheduler branch October 17, 2023 00:48

zhuohan123 mentioned this pull request Oct 17, 2023

Delay GPU->CPU sync in sampling #1337

Merged

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Change scheduler & input tensor shape (vllm-project#1381)

34506f4

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

Change scheduler & input tensor shape (vllm-project#1381)

33e954e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change scheduler & input tensor shape #1381

Change scheduler & input tensor shape #1381

WoosukKwon commented Oct 17, 2023

zhuohan123 left a comment

yunfeng-scale commented Oct 19, 2023 •

edited

Loading

frankxyy commented Mar 19, 2024

Change scheduler & input tensor shape #1381

Change scheduler & input tensor shape #1381

Conversation

WoosukKwon commented Oct 17, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

yunfeng-scale commented Oct 19, 2023 • edited Loading

frankxyy commented Mar 19, 2024

yunfeng-scale commented Oct 19, 2023 •

edited

Loading