[Bugfix] Fix dtype mismatch in PaliGemma #6367

DarkLight1337 · 2024-07-12T07:04:01Z

DarkLight1337 · 2024-07-12T07:07:26Z

vllm/model_executor/models/paligemma.py

@@ -111,7 +111,7 @@ def input_processor_for_paligemma(ctx: InputContext, llm_inputs: LLMInputs):
    orig_prompt = llm_inputs.get("prompt")
    orig_prompt_ids = llm_inputs.get("prompt_token_ids")

-    if image_token_str in orig_prompt:
+    if orig_prompt is not None and image_token_str in orig_prompt:


My IDE flagged this so I fixed it along the way.

ywang96 · 2024-07-12T07:09:13Z

Hmm that's weird - I thought the vision encoder and language model itself is always float32?

https://huggingface.co/google/paligemma-3b-mix-224/blob/main/config.json

DarkLight1337 · 2024-07-12T07:11:05Z

Hmm that's weird - I thought the vision encoder and language model itself is always float32?

https://huggingface.co/google/paligemma-3b-mix-224/blob/main/config.json

Doesn't vLLM update the dtype regardless of how it's defined on HF? (Didn't check the code but since we have an option to set the dtype via LLM args, that should be the case right?)

ywang96 · 2024-07-12T07:14:04Z

I wonder why we never needed to do this for Llava models - let me test the original issue and this PR and see if this fixes it.

I do think we should test this model on float32 though.

Signed-off-by: Alvant <[email protected]>

Fix paligemma

b21e498

DarkLight1337 requested a review from ywang96 July 12, 2024 07:04

yapf

2b3e7cd

DarkLight1337 commented Jul 12, 2024

View reviewed changes

DarkLight1337 added 4 commits July 12, 2024 07:15

Test both float and half dtypes

7296d9c

Fix missing arg to language model

f577b89

Fix missing arg to language model

3626a39

yapf

c73887e

ywang96 approved these changes Jul 12, 2024

View reviewed changes

ywang96 merged commit 024ad87 into vllm-project:main Jul 12, 2024
72 checks passed

DarkLight1337 deleted the fix-paligemma branch July 13, 2024 02:16

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024

[Bugfix] Fix dtype mismatch in PaliGemma (vllm-project#6367)

a68d100

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Bugfix] Fix dtype mismatch in PaliGemma (vllm-project#6367)

4257eb3

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Fix dtype mismatch in PaliGemma (vllm-project#6367)

be33c13

Signed-off-by: Alvant <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix dtype mismatch in PaliGemma #6367

[Bugfix] Fix dtype mismatch in PaliGemma #6367

DarkLight1337 commented Jul 12, 2024

DarkLight1337 Jul 12, 2024

ywang96 commented Jul 12, 2024

DarkLight1337 commented Jul 12, 2024 •

edited

Loading

ywang96 commented Jul 12, 2024

[Bugfix] Fix dtype mismatch in PaliGemma #6367

[Bugfix] Fix dtype mismatch in PaliGemma #6367

Conversation

DarkLight1337 commented Jul 12, 2024

DarkLight1337 Jul 12, 2024

Choose a reason for hiding this comment

ywang96 commented Jul 12, 2024

DarkLight1337 commented Jul 12, 2024 • edited Loading

ywang96 commented Jul 12, 2024

DarkLight1337 commented Jul 12, 2024 •

edited

Loading