[Bug]: Invalid inputs do not result in error #4339

dyastremsky · 2024-04-24T18:21:45Z

Your current environment

Start vLLM: docker run --gpus all -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 --ipc=host vllm/vllm-openai:latest --model gpt2 --dtype float16 --max-model-len 1024

🐛 Describe the bug

When running inference with vLLM, errors are not returned when an invalid input is supplied.

After starting vLLM as specified above, run a curl command that supplies invalid inputs: curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{ "model": "gpt2", "messages": [ { "role": "system", "content": "You are a helpful assistant.", "fake_input": "2" }, { "role": "user", "content": "Hello!", "fake_input": "3" } ] }'

You get a successful response. Fake_input is dropped instead of returning an error. This silent failure can be dangerous with typos or invalid inputs. Can this be fixed?

The text was updated successfully, but these errors were encountered:

simon-mo · 2024-04-24T18:33:54Z

Thanks for raising this issue. It does puzzle me why it happen since we have the pydantic model defined here

vllm/vllm/entrypoints/openai/protocol.py

Line 63 in aae0824

class ChatCompletionRequest(BaseModel):

and fastapi reference here

vllm/vllm/entrypoints/openai/api_server.py

Line 88 in aae0824

async def create_chat_completion(request: ChatCompletionRequest,

We also did not enable extra fields allowed.

simon-mo · 2024-04-24T18:34:03Z

Any help appreciated!

DarkLight1337 · 2024-04-25T05:50:24Z

The default setting in Pydantic (as documented here) silently drops extra fields.

In order to disallow extra fields, we have to explicitly disable it in the model class (extra="forbid"). Going to open a PR shortly.

DarkLight1337 · 2024-04-25T06:10:58Z

I have tested this and found that there is another reason why invalid keys in messages isn't being checked. Currently, messages is being annotated as Dict[str, str] so the schema is not fully specified. PR #3467 improves the type annotations but still does not limit the allowed keys. To fix this issue, I'm going to port over some type annotations from my WIP PR #4200.

Edit: Opened PR #4355.

dyastremsky · 2024-04-25T16:08:45Z

Wow, that was lightning fast, @DarkLight1337 and @simon-mo. Thanks for working on this so quickly!

dyastremsky added the bug Something isn't working label Apr 24, 2024

DarkLight1337 mentioned this issue Apr 25, 2024

[Frontend][Bugfix] Disallow extra fields in OpenAI API #4355

Merged

simon-mo closed this as completed in #4355 Apr 27, 2024

eduardozamudio mentioned this issue Jul 23, 2024

Support for tools / tool_choice="auto" in OpenAI-compatible API runpod-workers/worker-vllm#85

Open

2 tasks

DarkLight1337 mentioned this issue Sep 4, 2024

[Bug]: 400 Bad Request #4597

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Invalid inputs do not result in error #4339

[Bug]: Invalid inputs do not result in error #4339

dyastremsky commented Apr 24, 2024

simon-mo commented Apr 24, 2024

simon-mo commented Apr 24, 2024

DarkLight1337 commented Apr 25, 2024 •

edited

Loading

DarkLight1337 commented Apr 25, 2024 •

edited

Loading

dyastremsky commented Apr 25, 2024

[Bug]: Invalid inputs do not result in error #4339

[Bug]: Invalid inputs do not result in error #4339

Comments

dyastremsky commented Apr 24, 2024

Your current environment

🐛 Describe the bug

simon-mo commented Apr 24, 2024

simon-mo commented Apr 24, 2024

DarkLight1337 commented Apr 25, 2024 • edited Loading

DarkLight1337 commented Apr 25, 2024 • edited Loading

dyastremsky commented Apr 25, 2024

DarkLight1337 commented Apr 25, 2024 •

edited

Loading

DarkLight1337 commented Apr 25, 2024 •

edited

Loading