Benchmarking script for openai chat completion api are not supported #2940

wangchen615 · 2024-02-20T22:48:31Z

When running vllm with openai chat apis, the benchmarking script will fail as it asserts the backend API of assert api_url.endswith("v1/completions").

python benchmark_serving.py --backend openai --model mistralai/Mistral-7B-v0.1 --dataset ShareGPT_V3_unfiltered_cleaned_split.json --save-result

The logs are as follows:

Namespace(backend='openai', version='N/A', base_url=None, host='localhost', port=8000, endpoint='/generate', dataset='ShareGPT_V3_unfiltered_cleaned_split.json', model='mistralai/Mistral-7B-v0.1', tokenizer=None, best_of=1, use_beam_search=False, num_prompts=1000, request_rate=inf, seed=0, trust_remote_code=False, disable_tqdm=False, save_result=True)
  0%|                                                      | 0/1000 [00:00<?, ?it/s]Traffic request rate: inf
Traceback (most recent call last):
  File "/home/chenw/vllm/benchmarks/benchmark_serving.py", line 387, in <module>
    main(args)
  File "/home/chenw/vllm/benchmarks/benchmark_serving.py", line 259, in main
    benchmark_result = asyncio.run(
  File "/home/chenw/miniconda3/envs/myenv/lib/python3.9/asyncio/runners.py", line 44, in run
    return loop.run_until_complete(main)
  File "/home/chenw/miniconda3/envs/myenv/lib/python3.9/asyncio/base_events.py", line 647, in run_until_complete
    return future.result()
  File "/home/chenw/vllm/benchmarks/benchmark_serving.py", line 195, in benchmark
    outputs = await asyncio.gather(*tasks)
  File "/home/chenw/vllm/benchmarks/backend_request_func.py", line 223, in async_request_openai_completions
    assert api_url.endswith("v1/completions")
AssertionError
  0%|                                                      | 0/1000 [00:00<?, ?it/s]

The backend_request_func.py should not only allow chat apis like: assert api_url.endswith("v1/chat/completions").

The text was updated successfully, but these errors were encountered:

keep the openai backend url as /v1/completions and add openai-chat backend url as /v1/chat/completions yapf format add newline

wangchen615 mentioned this issue Feb 22, 2024

Fix the openai benchmarking requests to work with latest OpenAI apis #2992

Merged

wangchen615 added a commit to wangchen615/vllm that referenced this issue Feb 29, 2024

fix issue vllm-project#2940

3925602

keep the openai backend url as /v1/completions and add openai-chat backend url as /v1/chat/completions yapf format add newline

simon-mo closed this as completed in #2992 Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking script for openai chat completion api are not supported #2940

Benchmarking script for openai chat completion api are not supported #2940

wangchen615 commented Feb 20, 2024

Benchmarking script for openai chat completion api are not supported #2940

Benchmarking script for openai chat completion api are not supported #2940

Comments

wangchen615 commented Feb 20, 2024