Refactor AsyncLLMEngine #880

Yard1 · 2023-08-25T22:18:05Z

This PR aims to refactor and simplify AsyncLLMEngine implementation by moving to a single background scheduling loop. There should also be a tiny peformance boost.

Signed-off-by: Antoni Baum <[email protected]>

zhuohan123

Thanks for the contribution! Please see the detailed comments below.

vllm/core/scheduler.py

vllm/engine/async_llm_engine.py

vllm/engine/llm_engine.py

vllm/engine/async_llm_engine.py

Signed-off-by: Antoni Baum <[email protected]>

zhuohan123

Thanks for the changes! Left some minor comments. Left some minor style comments. Let's merge the PR after these changes.

vllm/engine/async_llm_engine.py

vllm/engine/llm_engine.py

esmeetu · 2023-09-04T00:52:32Z

FYI, Api server might stuck request when apply this PR.

Yard1 · 2023-09-04T01:45:30Z

@esmeetu can you elaborate?

Signed-off-by: Antoni Baum <[email protected]>

zhuohan123

LGTM! Thank you for your contribution!

esmeetu · 2023-09-04T10:29:16Z

@Yard1
Start server: python -m vllm.entrypoints.openai.api_server --model llama-2-7b

POST /v1/completions
Content-Type: application/json;charset=UTF-8

{
"model": "llama-2-7b",
"prompt": [
"hi"
],
"max_tokens": 20,
"temperature": 0
}

Then no response...

Yard1 added 8 commits August 25, 2023 15:01

Refactor AsyncLLMEngine

a2d5cc5

Signed-off-by: Antoni Baum <[email protected]>

Lint

9107bc9

Signed-off-by: Antoni Baum <[email protected]>

Rename

90ffbcc

Signed-off-by: Antoni Baum <[email protected]>

Add ray_remote_kwargs

302d6fa

Signed-off-by: Antoni Baum <[email protected]>

Nit

de30724

Signed-off-by: Antoni Baum <[email protected]>

Merge branch 'vllm-project:main' into async_engine_refactor

b5de93e

Remove debug

e44fca4

Signed-off-by: Antoni Baum <[email protected]>

Nit

9b1b34a

Signed-off-by: Antoni Baum <[email protected]>

zhuohan123 requested changes Aug 30, 2023

View reviewed changes

zhuohan123 mentioned this pull request Aug 30, 2023

Fix event error del to is_engine_running always true (#286) #885

Closed

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

Apply feedback from code review

29a0ad4

Signed-off-by: Antoni Baum <[email protected]>

Yard1 requested a review from zhuohan123 September 1, 2023 00:52

Handle cancellations better

217d2b4

Signed-off-by: Antoni Baum <[email protected]>

zhuohan123 requested changes Sep 4, 2023

View reviewed changes

Apply feedback from code review

d8fb811

Signed-off-by: Antoni Baum <[email protected]>

Yard1 requested a review from zhuohan123 September 4, 2023 01:51

zhuohan123 approved these changes Sep 4, 2023

View reviewed changes

zhuohan123 merged commit ce741ba into vllm-project:main Sep 4, 2023
2 checks passed

Yard1 deleted the async_engine_refactor branch September 4, 2023 04:45

Yard1 mentioned this pull request Sep 4, 2023

Initialize AsyncLLMEngine bg loop correctly #943

Merged

liuyanyi pushed a commit to liuyanyi/vllm that referenced this pull request Sep 12, 2023

Refactor AsyncLLMEngine (vllm-project#880)

9af27cb

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Refactor AsyncLLMEngine (vllm-project#880)

bb98fb4

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

Refactor AsyncLLMEngine (vllm-project#880)

b14c1b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor AsyncLLMEngine #880

Refactor AsyncLLMEngine #880

Yard1 commented Aug 25, 2023

zhuohan123 left a comment

zhuohan123 left a comment

esmeetu commented Sep 4, 2023

Yard1 commented Sep 4, 2023

zhuohan123 left a comment

esmeetu commented Sep 4, 2023

Refactor AsyncLLMEngine #880

Refactor AsyncLLMEngine #880

Conversation

Yard1 commented Aug 25, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment

esmeetu commented Sep 4, 2023

Yard1 commented Sep 4, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

esmeetu commented Sep 4, 2023