multi-lora documentation fix #3064

ElefHead · 2024-02-27T21:47:20Z

Minor fixes to the documentation for open-ai compatible server.

simon-mo · 2024-02-27T21:59:29Z

Hi @ElefHead, thank you for your PR, however, at the beginning of the api_server.py, we noted:

Lines 1 to 5 in 71bcaf9

    
           """ 
        
           NOTE: This API server is used only for demonstrating usage of AsyncEngine and simple performance benchmarks. 
        
           It is not intended for production use. For production use, we recommend using our OpenAI compatible server. 
        
           We are also not going to accept PRs modifying this file, please change `vllm/entrypoints/openai/api_server.py` instead. 
        
           """

Please revert the change in api_server, we are happy to accept the documentation change!

This reverts commit 5166259.

ElefHead · 2024-02-27T22:20:19Z

@simon-mo Thanks for letting me know, i missed it. I reverted changes and made small addition to the docs. Cheers!

findalexli · 2024-02-28T00:14:01Z

Thanks! Was just confused between using vllm.entrypoints.openai.api_server and vllm.entrypoints.api_server

sleepwalker2017 · 2024-03-04T07:44:19Z

Hello, why only support s-lora in openai api?

Any plans to support it in vllm.entrypoints.api_server？Thank you. @ElefHead

I mentioned it here:
#3174

Seems the implementation is not completed. There is no lora information in the generate server api.

simon-mo · 2024-03-04T18:37:31Z

The generate server API is not recommended for production usage. We kept it not to break existing usage. The OpenAI server provide a superset of functionality and uses the same engine under the hood.

ganesh-dataminr added 3 commits February 27, 2024 21:45

fix: multi-lora with sample api-server

5166259

fix: minor formats

3599a06

fix: minor formats for yapf

8499a8f

ElefHead changed the title ~~fix: multi-lora with sample api-server~~ multi-lora with sample api-server Feb 27, 2024

ganesh-dataminr added 2 commits February 27, 2024 22:05

Revert "fix: multi-lora with sample api-server"

aed3779

This reverts commit 5166259.

fix: fixed documentation for lora

6623909

ElefHead changed the title ~~multi-lora with sample api-server~~ multi-lora documentation fix Feb 27, 2024

simon-mo approved these changes Feb 28, 2024

View reviewed changes

simon-mo merged commit a868310 into vllm-project:main Feb 28, 2024
22 checks passed

ElefHead deleted the gj/multi-lora-server branch February 28, 2024 05:27

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

multi-lora documentation fix (vllm-project#3064)

a121ab4

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

multi-lora documentation fix (vllm-project#3064)

1eb784b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-lora documentation fix #3064

multi-lora documentation fix #3064

ElefHead commented Feb 27, 2024 •

edited

Loading

simon-mo commented Feb 27, 2024

ElefHead commented Feb 27, 2024

findalexli commented Feb 28, 2024

sleepwalker2017 commented Mar 4, 2024 •

edited

Loading

simon-mo commented Mar 4, 2024

multi-lora documentation fix #3064

multi-lora documentation fix #3064

Conversation

ElefHead commented Feb 27, 2024 • edited Loading

simon-mo commented Feb 27, 2024

ElefHead commented Feb 27, 2024

findalexli commented Feb 28, 2024

sleepwalker2017 commented Mar 4, 2024 • edited Loading

simon-mo commented Mar 4, 2024

ElefHead commented Feb 27, 2024 •

edited

Loading

sleepwalker2017 commented Mar 4, 2024 •

edited

Loading