Fix vLLM CPU api_server params #11384

xiangyuT · 2024-06-21T00:59:44Z

Description

Add load_in_low_bit to ipex_llm.vllm.cpu.entrypoints.openai.api_server

glorysdj

LGTM

fix

145a8a1

glorysdj approved these changes Jun 21, 2024

View reviewed changes

xiangyuT mentioned this pull request Jun 21, 2024

vLLM CPU example load-in-low-bit is not used #11360

Open

xiangyuT merged commit b30bf76 into intel-analytics:main Jun 21, 2024
18 checks passed

RyuKosei pushed a commit to RyuKosei/ipex-llm that referenced this pull request Jul 19, 2024

Fix vLLM CPU api_server params (intel-analytics#11384)

390efd2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vLLM CPU api_server params #11384

Fix vLLM CPU api_server params #11384

xiangyuT commented Jun 21, 2024

glorysdj left a comment

Fix vLLM CPU api_server params #11384

Fix vLLM CPU api_server params #11384

Conversation

xiangyuT commented Jun 21, 2024

Description

glorysdj left a comment

Choose a reason for hiding this comment