[Misc]: Can we remove `vllm/entrypoints/api_server.py`? #3852

hmellor · 2024-04-04T12:58:49Z

Anything you want to discuss about vllm.

While gardening GitHub issues I've seen many issues where the user is using -m vllm.entrypoints.api_server, which indicates that users are either not aware of ignoring the note at the top of the file:

vllm/vllm/entrypoints/api_server.py

Lines 1 to 7 in b778200

    
           """ 
        
           NOTE: This API server is used only for demonstrating usage of AsyncEngine 
        
           and simple performance benchmarks. It is not intended for production use. 
        
           For production use, we recommend using our OpenAI compatible server. 
        
           We are also not going to accept PRs modifying this file, please 
        
           change `vllm/entrypoints/openai/api_server.py` instead. 
        
           """

Can we remove it to avoid future confusion?

The text was updated successfully, but these errors were encountered:

FlorianJoncour · 2024-04-04T15:33:09Z

It's probably a good idea.

However, it would probably be better to move it into a subfolder like vllm_example.
On the other hand, I don't think it would be desirable for the OpenAI server to become the only implementation.

It's possible that in the future, it could be useful to create servers implementation of Mistral AI API or Anthropic.

hmellor · 2024-04-04T16:00:18Z

We can leave the openai server where it is so that in future if other proprietary APIs are added they can live adjacent to the openai directory.

DarkLight1337 · 2024-06-02T13:44:52Z

There exist outstanding issues which suggest that there are some bugs in that file.

Fix API request validation #1158
When starting the second vllm.entrypoints.api_server using tensor parallel in a single node, the second vllm api_server Stuck in " Started a local Ray instance." OR "Failed to register worker 01000000ffffffffffffffffffffffffffffffffffffffffffffffff to Raylet. IOError: [RayletClient] Unable to register worker with raylet. No such file or directory" #3367
[Bugfix] [Frontend] vLLM api_server.py when using with prompt_token_ids causes error. #5187

Yet, #2858 clearly mentions that the file is not supposed to be further updated. @simon-mo what are the reasons behind this? If we want this file to serve as an example, IMO we should at least make sure that it is bug-free.

simon-mo · 2024-06-02T14:30:45Z

We can fix bug. We just shouldn't add more features to keep it with parity for OpenAI compatibility.

hmellor · 2024-06-02T17:43:01Z

Does vllm/entrypoints/api_server.py have any OpenAI compatibility?

DarkLight1337 · 2024-06-03T01:59:23Z

We can fix bug. We just shouldn't add more features to keep it with parity for OpenAI compatibility.

Perhaps we should move it to examples directory as @FlorianJoncour has suggested. Then we can add tests to ensure that the example is working without introducing new features.

TikZSZ · 2024-06-03T02:43:49Z

Maybe this could be kept for supporting features in beta/experimental or something that's not covered by open ai api spec.

For example this was the only way I found to send token ids directly to model instead of chat messages, which is helpful incase when people want to experiment with prompts without limitations.

For example mistral v3 function calling requires custom tokenization that isn't covered by chat template is one such example and I think it could be useful debugging other OSS models that come with their own prompt templates that sometimes may not be optimal for whatever reason.

Note: I might be missing the docs regarding sending tokens directly to open ai api spec but I didn't find anything regarding that.

simon-mo · 2024-06-03T17:41:46Z

Maybe this could be kept for supporting features in beta/experimental or something that's not covered by open ai api spec.

This is a valid point.

Perhaps we should move it to examples directory

We kept the file there for backward compatibility as some people are still using it.

DarkLight1337 · 2024-06-04T03:02:47Z

We kept the file there for backward compatibility as some people are still using it.

How long are we planning to maintain this? I think #4197 / #5090 would provide a good opportunity to drop it. We can also redirect imports/main to the new location of the file.

DarkLight1337 · 2024-06-04T03:09:18Z

Maybe this could be kept for supporting features in beta/experimental or something that's not covered by open ai api spec.

Correct me if I'm wrong, but it looks like we can't use this file directly to test out those features. If we have to copy this file and modify it elsewhere anyway, I don't see the harm in moving it to the examples and thus adding it to the official documentation.

github-actions · 2024-10-29T02:02:35Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

hmellor added the misc label Apr 4, 2024

hmellor changed the title ~~[Misc]: Can we remove vllm.entrypoints/api_server.py?~~ [Misc]: Can we remove vllm/entrypoints/api_server.py? Apr 5, 2024

youkaichao mentioned this issue Jul 1, 2024

[doc][misc] further lower visibility of simple api server #6041

Merged

github-actions bot added the stale label Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc]: Can we remove `vllm/entrypoints/api_server.py`? #3852

[Misc]: Can we remove `vllm/entrypoints/api_server.py`? #3852

hmellor commented Apr 4, 2024

FlorianJoncour commented Apr 4, 2024

hmellor commented Apr 4, 2024

DarkLight1337 commented Jun 2, 2024 •

edited

Loading

simon-mo commented Jun 2, 2024

hmellor commented Jun 2, 2024 •

edited

Loading

DarkLight1337 commented Jun 3, 2024 •

edited

Loading

TikZSZ commented Jun 3, 2024 •

edited

Loading

simon-mo commented Jun 3, 2024

DarkLight1337 commented Jun 4, 2024 •

edited

Loading

DarkLight1337 commented Jun 4, 2024

github-actions bot commented Oct 29, 2024

[Misc]: Can we remove vllm/entrypoints/api_server.py? #3852

[Misc]: Can we remove vllm/entrypoints/api_server.py? #3852

Comments

hmellor commented Apr 4, 2024

Anything you want to discuss about vllm.

FlorianJoncour commented Apr 4, 2024

hmellor commented Apr 4, 2024

DarkLight1337 commented Jun 2, 2024 • edited Loading

simon-mo commented Jun 2, 2024

hmellor commented Jun 2, 2024 • edited Loading

DarkLight1337 commented Jun 3, 2024 • edited Loading

TikZSZ commented Jun 3, 2024 • edited Loading

simon-mo commented Jun 3, 2024

DarkLight1337 commented Jun 4, 2024 • edited Loading

DarkLight1337 commented Jun 4, 2024

github-actions bot commented Oct 29, 2024

[Misc]: Can we remove `vllm/entrypoints/api_server.py`? #3852

[Misc]: Can we remove `vllm/entrypoints/api_server.py`? #3852

DarkLight1337 commented Jun 2, 2024 •

edited

Loading

hmellor commented Jun 2, 2024 •

edited

Loading

DarkLight1337 commented Jun 3, 2024 •

edited

Loading

TikZSZ commented Jun 3, 2024 •

edited

Loading

DarkLight1337 commented Jun 4, 2024 •

edited

Loading