v0.4.3 Release Tracker #4895

simon-mo · 2024-05-18T01:05:16Z

ETA May 30 (due to some blockers and US holiday).

Blockers

Nice to have

sasha0552 · 2024-05-18T21:02:12Z

Hi, is it possible to include the following PRs?

simon-mo · 2024-05-19T05:32:25Z

Thanks for bring these up @sasha0552!

#4167 is unlikely to be finished in time.
#4409 might need a little bit more discussion given what features are supported for Pascal GPUs and whether building from source might be a better option.
#4638 can be included if it gets merged in time.

We do commit to biweekly release cadence so don't worry many of these will get into soon enough!

robertgshaw2-neuralmagic · 2024-05-19T16:24:53Z

Thanks for bring these up @sasha0552!

#4167 is unlikely to be finished in time. #4409 might need a little bit more discussion given what features are supported for Pascal GPUs and whether building from source might be a better option. #4638 can be included if it gets merged in time.

We do commit to biweekly release cadence so don't worry many of these will get into soon enough!

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

robertgshaw2-neuralmagic · 2024-05-19T16:34:15Z

I am going to try to get these in

probably will not make it but tracking to v0.4.4:

njhill · 2024-05-22T18:21:17Z

Sounds like we may want to include #4894 @rkooo567?

rkooo567 · 2024-05-22T21:09:06Z

Yeah +1 on that PR @njhill

robertgshaw2-neuralmagic · 2024-05-23T09:17:08Z

Fix for mistral-v0.3: [Bugfix] Fix Mistral v0.3 Weight Loading #5005

jasonacox · 2024-05-27T04:26:29Z

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

Hi @robertgshaw2-neuralmagic - was this without the patch? I couldn't get a source build to run on P100's without the patch of #4409. With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

sasha0552 · 2024-05-27T04:34:46Z

With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

Not only fp16, but AQLM works well too (#5058)

robertgshaw2-neuralmagic · 2024-05-27T21:49:40Z

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

Hi @robertgshaw2-neuralmagic - was this without the patch? I couldn't get a source build to run on P100's without the patch of #4409. With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

P40 requires building with the patch.

vrdn-23 · 2024-05-28T17:12:41Z

Is there any particular PR that we're waiting for before cutting the release?

robertgshaw2-neuralmagic · 2024-05-28T17:44:12Z

Is there any particular PR that we're waiting for before cutting the release?

The model support for Phi and Deepseek

AmazDeng · 2024-06-01T05:54:04Z

Is there any particular PR that we're waiting for before cutting the release?

The model support for Phi and Deepseek

Excuse me, when will VLLM support embedding input?

dongxiaolong · 2024-06-03T01:23:16Z

Could we include #4109 ? Structured output is also very important, and it seems almost complete. @simon-mo

Link to PR #4109

LSC527 · 2024-06-03T07:33:37Z

Is there any particular PR that we're waiting for before cutting the release?

The model support for Phi and Deepseek

Hi, is Deepseek v2 supported now?

simon-mo · 2024-06-03T17:04:07Z

Moving Deepseek to optional in #5224
Tracking #4109 as optional in #5224

simon-mo added the release Related to new version release label May 18, 2024

simon-mo mentioned this issue May 18, 2024

[Misc]: When is the planned date for the next release? #4892

Closed

simon-mo pinned this issue May 21, 2024

saattrupdan mentioned this issue May 25, 2024

[MODEL EVALUATION REQUEST] mistralai/Mistral-7B-v0.3 ScandEval/ScandEval#440

Closed

8 tasks

simon-mo closed this as completed Jun 3, 2024

simon-mo unpinned this issue Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.4.3 Release Tracker #4895

v0.4.3 Release Tracker #4895

simon-mo commented May 18, 2024 •

edited by robertgshaw2-neuralmagic

Loading

sasha0552 commented May 18, 2024

simon-mo commented May 19, 2024

robertgshaw2-neuralmagic commented May 19, 2024

robertgshaw2-neuralmagic commented May 19, 2024 •

edited

Loading

njhill commented May 22, 2024

rkooo567 commented May 22, 2024

robertgshaw2-neuralmagic commented May 23, 2024 •

edited

Loading

jasonacox commented May 27, 2024

sasha0552 commented May 27, 2024

robertgshaw2-neuralmagic commented May 27, 2024

vrdn-23 commented May 28, 2024

robertgshaw2-neuralmagic commented May 28, 2024

AmazDeng commented Jun 1, 2024

dongxiaolong commented Jun 3, 2024

LSC527 commented Jun 3, 2024

simon-mo commented Jun 3, 2024

v0.4.3 Release Tracker #4895

v0.4.3 Release Tracker #4895

Comments

simon-mo commented May 18, 2024 • edited by robertgshaw2-neuralmagic Loading

sasha0552 commented May 18, 2024

simon-mo commented May 19, 2024

robertgshaw2-neuralmagic commented May 19, 2024

robertgshaw2-neuralmagic commented May 19, 2024 • edited Loading

njhill commented May 22, 2024

rkooo567 commented May 22, 2024

robertgshaw2-neuralmagic commented May 23, 2024 • edited Loading

jasonacox commented May 27, 2024

sasha0552 commented May 27, 2024

robertgshaw2-neuralmagic commented May 27, 2024

vrdn-23 commented May 28, 2024

robertgshaw2-neuralmagic commented May 28, 2024

AmazDeng commented Jun 1, 2024

dongxiaolong commented Jun 3, 2024

LSC527 commented Jun 3, 2024

simon-mo commented Jun 3, 2024

simon-mo commented May 18, 2024 •

edited by robertgshaw2-neuralmagic

Loading

robertgshaw2-neuralmagic commented May 19, 2024 •

edited

Loading

robertgshaw2-neuralmagic commented May 23, 2024 •

edited

Loading