[Roadmap] vLLM Roadmap Q1 2024 #2681

zhuohan123 · 2024-01-31T06:31:06Z

sandangel · 2024-01-31T11:38:47Z

Is it possible to support mlx for running inference on Mac devices? That would simplify the local development and running on cloud.

AguirreNicolas · 2024-02-01T10:07:01Z

As mentioned in #2643, it would be awesome to have vLLM /completions & /chat/completions endpoints both supporting logprobs to run lm-eval-harness.

PeterXiaTian · 2024-02-02T08:45:46Z

please take attention with "Evaluation of Accelerated and Non-Accelerated Large Model Output",it is very important and make sure they are always same

jrruethe · 2024-02-05T17:08:40Z

As mentioned in #2643, it would be awesome to have vLLM /completions & /chat/completions endpoints both supporting logprobs to run lm-eval-harness.

Agree 100%, the ability to use lm-eval-harness is very much needed

casper-hansen · 2024-02-05T18:21:49Z

#2767 I suggest adding this to the roadmap as it's one of the more straight forward optimizations (someone already did the optimization work).

jalotra · 2024-02-08T15:47:50Z

#2573
this talks about optimising api server Optimize the performance of the API server

cyc00518 · 2024-02-23T03:52:03Z

Please support ARM aarch-64 architecture.

Tint0ri · 2024-02-28T00:46:23Z

#1253

please consider support streamingllm

kanseaveg · 2024-03-02T06:34:46Z

Any update for PEFT?

please consider support huggingface peft, thank you. ##1129

ekazakos · 2024-03-04T04:08:08Z

Would you consider adding support for earlier ROCm versions, e.g. 5.6.1.? Thank you!

pabl-o-ce · 2024-03-05T07:14:31Z

If is possible exl2 support thank you <3

hmellor · 2024-03-08T10:51:48Z

#97 should be added to the automating the release process section

jrruethe · 2024-03-08T14:54:45Z

Also, the ability to use Guidance/Outlines via logit_bias!
And +1 to EXL2 support

busishengui · 2024-03-12T06:07:15Z

support W8A8

simon-mo · 2024-04-04T22:38:54Z

Let's migrate out discussion to #3861

zhuohan123 mentioned this issue Jan 31, 2024

[Roadmap] vLLM Development Roadmap: H2 2023 #244

Closed

76 tasks

zhuohan123 pinned this issue Jan 31, 2024

hmellor mentioned this issue Feb 7, 2024

Mistral model architecture is not supported #2749

Closed

alboimDor mentioned this issue Feb 11, 2024

Support for TPU hardware #2835

Closed

DavidPeleg6 mentioned this issue Feb 20, 2024

E5-mistral-7b-instruct embedding support #2936

Closed

muhammad-asn mentioned this issue Feb 21, 2024

Is the DeepSpeed-MII will support habana (HPU) hardware? microsoft/DeepSpeed-MII#416

Open

mgoin mentioned this issue Feb 22, 2024

Static cache & torch.compile #2969

Closed

fblissjr mentioned this issue Mar 1, 2024

Using mlx as backend sgl-project/sglang#188

Closed

hmellor mentioned this issue Mar 10, 2024

RuntimeError: No supported device detected, while running a standard example #3304

Closed

duanzhaol mentioned this issue Mar 11, 2024

What's up with Pipeline Parallelism? #3314

Open

This was referenced Mar 13, 2024

TorchDynamo & XLA in VLLM #1475

Closed

Question: CPU and quantization support? #1481

Closed

2x speedup with IBM foundation stack #1615

Closed

simon-mo mentioned this issue Apr 4, 2024

[Roadmap] vLLM Roadmap Q2 2024 #3861

Closed

65 tasks

simon-mo closed this as completed Apr 4, 2024

simon-mo unpinned this issue Apr 4, 2024

simon-mo mentioned this issue Oct 1, 2024

[Roadmap] vLLM Roadmap Q4 2024 #9006

Open

39 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] vLLM Roadmap Q1 2024 #2681

[Roadmap] vLLM Roadmap Q1 2024 #2681

zhuohan123 commented Jan 31, 2024 •

edited

Loading

sandangel commented Jan 31, 2024

AguirreNicolas commented Feb 1, 2024

PeterXiaTian commented Feb 2, 2024

jrruethe commented Feb 5, 2024

casper-hansen commented Feb 5, 2024

jalotra commented Feb 8, 2024

cyc00518 commented Feb 23, 2024

Tint0ri commented Feb 28, 2024

kanseaveg commented Mar 2, 2024

ekazakos commented Mar 4, 2024

pabl-o-ce commented Mar 5, 2024

hmellor commented Mar 8, 2024

jrruethe commented Mar 8, 2024

busishengui commented Mar 12, 2024

simon-mo commented Apr 4, 2024

[Roadmap] vLLM Roadmap Q1 2024 #2681

[Roadmap] vLLM Roadmap Q1 2024 #2681

Comments

zhuohan123 commented Jan 31, 2024 • edited Loading

sandangel commented Jan 31, 2024

AguirreNicolas commented Feb 1, 2024

PeterXiaTian commented Feb 2, 2024

jrruethe commented Feb 5, 2024

casper-hansen commented Feb 5, 2024

jalotra commented Feb 8, 2024

cyc00518 commented Feb 23, 2024

Tint0ri commented Feb 28, 2024

kanseaveg commented Mar 2, 2024

ekazakos commented Mar 4, 2024

pabl-o-ce commented Mar 5, 2024

hmellor commented Mar 8, 2024

jrruethe commented Mar 8, 2024

busishengui commented Mar 12, 2024

simon-mo commented Apr 4, 2024

zhuohan123 commented Jan 31, 2024 •

edited

Loading