[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 #9944

FurtherAI · 2024-11-02T01:37:04Z

[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1

This is related to #9303 and #9759 and adds support for Qwen2VL as an embedding model. This is only a couple of lines of changes and a jinja template for the API.

I have tested that this is working correctly with MrLight/dse-qwen2-2b-mrl-v1. The test file is a bit ugly though. I don't think the hf_runner supports the input processing that needs to be done for this model because it uses process_vision_info and applies a chat template manually. It also takes a minimum size image placeholder for text embedding.

Looking for a little help with how the test should be written to cleanly handle these differences.

Update documentation to include this.
Add a test for the API maybe.

I have tested the API, but didn't look into how to add it to test_vision_embedding.py yet.

cc @DarkLight1337

…en2-2b-MRL-V1.

github-actions · 2024-11-02T01:37:16Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

examples/template_dse_qwen2_vl.jinja

FurtherAI · 2024-11-07T20:00:20Z

@DarkLight1337 Do you want any changes to the vision embedding API test? It looks like generally those tests are more about the endpoint than if the model they're calling is returning the correct output.

…1 as an embedding model. Signed-off-by: FurtherAI <[email protected]>

Signed-off-by: FurtherAI <[email protected]>

docs/source/models/supported_models.rst

examples/openai_chat_embedding_client_for_multimodal.py

DarkLight1337 · 2024-11-08T03:32:33Z

Do you want any changes to the vision embedding API test? It looks like generally those tests are more about the endpoint than if the model they're calling is returning the correct output.

It looks quite complicated. Can you factor out the common code so it is more like the other embedding model tests?

Co-authored-by: Cyrus Leung <[email protected]>

FurtherAI · 2024-11-08T17:35:37Z

Do you want any changes to the vision embedding API test? It looks like generally those tests are more about the endpoint than if the model they're calling is returning the correct output.

It looks quite complicated. Can you factor out the common code so it is more like the other embedding model tests?

Do you mean the offline test looks complicated? I think the question still stands about if you want anything added to the API test.

DarkLight1337 · 2024-11-09T02:22:52Z

Do you want any changes to the vision embedding API test? It looks like generally those tests are more about the endpoint than if the model they're calling is returning the correct output.

It looks quite complicated. Can you factor out the common code so it is more like the other embedding model tests?

Do you mean the offline test looks complicated? I think the question still stands about if you want anything added to the API test.

I was referring to the offline API tests. I don't see any changes to the online API tests though (which are under entrypoints/openai) - what do you mean by API tests?

mergify · 2024-11-09T03:38:17Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @FurtherAI.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

FurtherAI · 2024-11-09T04:27:42Z

Do you want any changes to the vision embedding API test? It looks like generally those tests are more about the endpoint than if the model they're calling is returning the correct output.

It looks quite complicated. Can you factor out the common code so it is more like the other embedding model tests?

Do you mean the offline test looks complicated? I think the question still stands about if you want anything added to the API test.

I was referring to the offline API tests. I don't see any changes to the online API tests though (which are under entrypoints/openai) - what do you mean by API tests?

Those are the API tests I was referring to.

I can improve the code for the test I think. I'll test it next week.

DarkLight1337 · 2024-11-09T04:29:18Z

There is no need to update online API tests for new models.

Signed-off-by: austin.veselka <[email protected]>

DarkLight1337 · 2024-11-13T02:42:54Z

PTAL at the failing model tests. Can we omit using qwen2_vl_utils?

FurtherAI · 2024-11-13T02:54:15Z

PTAL at the failing model tests. Can we omit using qwen2_vl_utils?

Yeah I can probably drop that. I think it is only resizing the image

Signed-off-by: austin.veselka <[email protected]>

DarkLight1337 · 2024-11-13T03:14:25Z

Please also merge from main to fix the broken CI.

DarkLight1337

The tests have passed, thanks for your patience!

…ht/dse-qwen2-2b-mrl-v1 (vllm-project#9944) Signed-off-by: FurtherAI <[email protected]> Co-authored-by: FurtherAI <[email protected]>

…ht/dse-qwen2-2b-mrl-v1 (vllm-project#9944) Signed-off-by: FurtherAI <[email protected]> Co-authored-by: FurtherAI <[email protected]> Signed-off-by: OmerD <[email protected]>

…ht/dse-qwen2-2b-mrl-v1 (vllm-project#9944) Signed-off-by: FurtherAI <[email protected]> Co-authored-by: FurtherAI <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Added support for Qwen2VL embeddings. Specifically tested with DSE-Qw…

f00967c

…en2-2b-MRL-V1.

FurtherAI requested review from DarkLight1337 and ywang96 as code owners November 2, 2024 01:37

DarkLight1337 reviewed Nov 2, 2024

View reviewed changes

examples/template_dse_qwen2_vl.jinja Outdated Show resolved Hide resolved

DarkLight1337 self-assigned this Nov 2, 2024

Merge branch 'main' into dse_qwen2_2b_mrl_v1

f3bd4da

Adding examples to documentation for using MrLight/dse-qwen2-2b-mrl-v…

beec785

…1 as an embedding model. Signed-off-by: FurtherAI <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Nov 7, 2024

Fix documentation

4330667

Signed-off-by: FurtherAI <[email protected]>

DarkLight1337 reviewed Nov 8, 2024

View reviewed changes

docs/source/models/supported_models.rst Outdated Show resolved Hide resolved

examples/openai_chat_embedding_client_for_multimodal.py Outdated Show resolved Hide resolved

Update docs/source/models/supported_models.rst

fcb16fe

Co-authored-by: Cyrus Leung <[email protected]>

mergify bot added the needs-rebase label Nov 9, 2024

austin.veselka added 3 commits November 12, 2024 16:45

Merge branch 'main' into dse_qwen2_2b_mrl_v1

48b44a6

[Model][Tests] Improve the test for DSE Qwen2VL

12e28d7

Signed-off-by: austin.veselka <[email protected]>

[Merge] Merge main

dc238a3

Signed-off-by: austin.veselka <[email protected]>

mergify bot removed the needs-rebase label Nov 12, 2024

[Model][Tests] Remove qwen_vl_utils dependency

1186516

Signed-off-by: austin.veselka <[email protected]>

Merge branch 'main' into dse_qwen2_2b_mrl_v1

8139112

DarkLight1337 mentioned this pull request Nov 13, 2024

[RFC]: Multi-modality Support Refactoring #4194

Open

DarkLight1337 approved these changes Nov 13, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) November 13, 2024 07:03

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 13, 2024

DarkLight1337 merged commit 1b886aa into vllm-project:main Nov 13, 2024
65 checks passed

FurtherAI deleted the dse_qwen2_2b_mrl_v1 branch November 13, 2024 17:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 #9944

[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 #9944

FurtherAI commented Nov 2, 2024 •

edited

Loading

github-actions bot commented Nov 2, 2024

FurtherAI commented Nov 7, 2024

DarkLight1337 commented Nov 8, 2024

FurtherAI commented Nov 8, 2024

DarkLight1337 commented Nov 9, 2024 •

edited

Loading

mergify bot commented Nov 9, 2024

FurtherAI commented Nov 9, 2024

DarkLight1337 commented Nov 9, 2024

DarkLight1337 commented Nov 13, 2024

FurtherAI commented Nov 13, 2024

DarkLight1337 commented Nov 13, 2024

DarkLight1337 left a comment

[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 #9944

[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 #9944

Conversation

FurtherAI commented Nov 2, 2024 • edited Loading

github-actions bot commented Nov 2, 2024

FurtherAI commented Nov 7, 2024

DarkLight1337 commented Nov 8, 2024

FurtherAI commented Nov 8, 2024

DarkLight1337 commented Nov 9, 2024 • edited Loading

mergify bot commented Nov 9, 2024

FurtherAI commented Nov 9, 2024

DarkLight1337 commented Nov 9, 2024

DarkLight1337 commented Nov 13, 2024

FurtherAI commented Nov 13, 2024

DarkLight1337 commented Nov 13, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

FurtherAI commented Nov 2, 2024 •

edited

Loading

DarkLight1337 commented Nov 9, 2024 •

edited

Loading