can model Qwen/Qwen-VL-Chat work well? #962

wangschang · 2023-09-06T10:18:59Z

when i use Qwen/Qwen-VL-Chat I do not know why!

throw a error

Traceback (most recent call last): File "test.py", line 20, in <module> model = LLM(model=model_path, tokenizer=model_path,tokenizer_mode='slow',tensor_parallel_size=1,trust_remote_code=True) File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/entrypoints/llm.py", line 66, in __init__ self.llm_engine = LLMEngine.from_engine_args(engine_args) File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/engine/llm_engine.py", line 220, in from_engine_args engine = cls(*engine_configs, File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/engine/llm_engine.py", line 101, in __init__ self._init_workers(distributed_init_method) File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/engine/llm_engine.py", line 133, in _init_workers self._run_workers( File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/engine/llm_engine.py", line 470, in _run_workers output = executor(*args, **kwargs) File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/worker/worker.py", line 67, in init_model self.model = get_model(self.model_config) File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/model_executor/model_loader.py", line 57, in get_model model.load_weights(model_config.model, model_config.download_dir, File "/usr/local/miniconda3/lib/python3.8/site-packages/vllm/model_executor/models/qwen.py", line 308, in load_weights param = state_dict[name] KeyError: 'transformer.visual.positional_embedding'

the code is

`from vllm import LLM, SamplingParams
from transformers import AutoModelForCausalLM, AutoTokenizer,AutoConfig
import time

model_path="Qwen/Qwen-VL-Chat"

model = LLM(model=model_path, tokenizer=model_path,tokenizer_mode='slow',tensor_parallel_size=1,trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_path, legacy=True, trust_remote_code=True)

sampling_params = SamplingParams(temperature=0,max_tokens=8096)
start=time.time()
prompts = ["你好！"]
outputs = model.generate(prompts, sampling_params)
end = time.time()
for output in outputs:
prompt = output.prompt
generated_text = output.outputs[0].text
length = len(generated_text)
print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")
print(end-start)
cost = end-start
print(f"{length/cost}tokens/s")`

The text was updated successfully, but these errors were encountered:

iFe1er · 2023-11-14T09:14:20Z

same question here

hntee · 2023-12-29T07:44:24Z

Same issue. I think this is because model_executor/models/qwen.py is only for Qwen-7B-Chat, not compatible with Qwen-VL.
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/qwen.py

ruifengma · 2024-03-06T00:58:06Z

same issue

linzm1007 · 2024-05-24T01:38:53Z

有解决了没

NDDec · 2024-05-28T02:41:56Z

same issue

hmellor · 2024-05-31T19:50:39Z

Please stop saying same issue, just react to the original message to show your support

DamonFool · 2024-06-20T14:29:59Z

For text-only inputs, we can run the model with this patch #5710 .

alex-jw-brooks · 2024-08-07T18:13:48Z

I am looking into adding support for image inputs for Qwen-VL/Qwen-VL-Chat 😄

hmellor · 2024-08-27T09:46:01Z

@alex-jw-brooks thanks for the upcoming contribution!

When you have updates, please post them here as I've closed the other issues as duplicates.

alex-jw-brooks · 2024-08-28T08:12:08Z

Great, thank you @hmellor - It's almost ready, I've been able to load and get reasonable looking stuff out of qwen-vl/qwen-vl-chat, just need to work through some cleanup, small fixes, and tests. I will open the PR in the next couple of days 🤞

alex-jw-brooks · 2024-08-30T11:17:28Z

PR #8029

DarkLight1337 mentioned this issue May 31, 2024

[RFC]: Multi-modality Support Refactoring #4194

Open

hmellor added the feature request label May 31, 2024

DarkLight1337 added new model Requests to new models and removed feature request labels Jun 1, 2024

DamonFool mentioned this issue Jun 20, 2024

[Model] Support Qwen-VL and Qwen-VL-Chat models with text-only inputs #5710

Merged

This was referenced Aug 27, 2024

[Model]: Support for Qwen-VL model #7192

Closed

[Feature]: Not support Qwen-VL-Chat #7017

Closed

支持qwen-vl吗？ #2903

Closed

请问lora后的vl模型可以支持吗，如swift微调qwen-vl-chat后，可以支持部署吗 #2902

Closed

alex-jw-brooks mentioned this issue Aug 30, 2024

[MODEL] Qwen Multimodal Support (Qwen-VL / Qwen-VL-Chat) #8029

Merged

DarkLight1337 closed this as completed in #8029 Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can model Qwen/Qwen-VL-Chat work well? #962

can model Qwen/Qwen-VL-Chat work well? #962

wangschang commented Sep 6, 2023

iFe1er commented Nov 14, 2023

hntee commented Dec 29, 2023 •

edited

Loading

ruifengma commented Mar 6, 2024

linzm1007 commented May 24, 2024

NDDec commented May 28, 2024

hmellor commented May 31, 2024

DamonFool commented Jun 20, 2024

alex-jw-brooks commented Aug 7, 2024 •

edited

Loading

hmellor commented Aug 27, 2024

alex-jw-brooks commented Aug 28, 2024 •

edited

Loading

alex-jw-brooks commented Aug 30, 2024 •

edited

Loading

can model Qwen/Qwen-VL-Chat work well? #962

can model Qwen/Qwen-VL-Chat work well? #962

Comments

wangschang commented Sep 6, 2023

iFe1er commented Nov 14, 2023

hntee commented Dec 29, 2023 • edited Loading

ruifengma commented Mar 6, 2024

linzm1007 commented May 24, 2024

NDDec commented May 28, 2024

hmellor commented May 31, 2024

DamonFool commented Jun 20, 2024

alex-jw-brooks commented Aug 7, 2024 • edited Loading

hmellor commented Aug 27, 2024

alex-jw-brooks commented Aug 28, 2024 • edited Loading

alex-jw-brooks commented Aug 30, 2024 • edited Loading

hntee commented Dec 29, 2023 •

edited

Loading

alex-jw-brooks commented Aug 7, 2024 •

edited

Loading

alex-jw-brooks commented Aug 28, 2024 •

edited

Loading

alex-jw-brooks commented Aug 30, 2024 •

edited

Loading