[Bug]: Model architectures ['LlavaForCausalLM'] are not supported for now in vllm 0.4.0.post1 #4008

stikkireddy · 2024-04-11T13:13:33Z

Your current environment

The output of `python collect_env.py`

🐛 Describe the bug

I am using:

from vllm import LLM
model = LLM(
            model="llava-hf/llava-1.5-13b-hf",
            image_input_type="pixel_values",
            download_dir="/tmp/models",
            image_token_id=32000,
            image_input_shape="1,3,336,336",
            image_feature_size=576,
        )

which throws error: ValueError: Model architectures ['LlavaForCausalLM'] are not supported for now. Supported architectures: ['AquilaModel', 'AquilaForCausalLM', 'BaiChuanForCausalLM', 'BaichuanForCausalLM', 'BloomForCausalLM', 'ChatGLMModel', 'ChatGLMForConditionalGeneration', 'CohereForCausalLM', 'DbrxForCausalLM', 'DeciLMForCausalLM', 'DeepseekForCausalLM', 'FalconForCausalLM', 'GemmaForCausalLM', 'GPT2LMHeadModel', 'GPTBigCodeForCausalLM', 'GPTJForCausalLM', 'GPTNeoXForCausalLM', 'InternLMForCausalLM', 'InternLM2ForCausalLM', 'JAISLMHeadModel', 'LlamaForCausalLM', 'LlavaForConditionalGeneration', 'LLaMAForCausalLM', 'MistralForCausalLM', 'MixtralForCausalLM', 'QuantMixtralForCausalLM', 'MptForCausalLM', 'MPTForCausalLM', 'OLMoForCausalLM', 'OPTForCausalLM', 'OrionForCausalLM', 'PhiForCausalLM', 'QWenLMHeadModel', 'Qwen2ForCausalLM', 'Qwen2MoeForCausalLM', 'RWForCausalLM', 'StableLMEpochForCausalLM', 'StableLmForCausalLM', 'Starcoder2ForCausalLM', 'XverseForCausalLM']

After checking huggingface llava-1.5-7b-hf uses LlavaForConditionalGeneration and llava-1.5-13b-hf uses LlavaForCausalLM?

Any easy workaround / fix for this?

The text was updated successfully, but these errors were encountered:

stikkireddy · 2024-04-11T13:14:47Z

maybe this is a feature request idk enough about the intent of supporting llava.

DarkLight1337 · 2024-04-11T13:21:27Z

Don't know how I missed that when writing PR #3978. Interesting... was the 7b model modified specifically to facilitate the proof-of-concept for vLLM?

stikkireddy · 2024-04-11T13:28:43Z

Unsure I went and reviewed this #3042 and it seems llava 7b and 13b have different classes. We only supported LlavaForConditionalGeneration but not LlavaForCausalLM.

stikkireddy · 2024-04-11T13:35:24Z

@xwjiang2010 do you happen to know the effort to support LLaVa 13b. Pinging you since you worked on the initial PR for vision models.

DarkLight1337 · 2024-04-11T13:37:51Z

I think it's just a typo in their HuggingFace config.json. I can successfully load the 13b model in vLLM by pretending that it has LlavaForConditionalGeneration architecture. I applied the following patch on top of DarkLight1337:openai-vision-api:

diff --git a/tests/conftest.py b/tests/conftest.py
index a7e8963..61ca4fe 100644
--- a/tests/conftest.py
+++ b/tests/conftest.py
@@ -131,6 +131,7 @@ _STR_DTYPE_TO_TORCH_DTYPE = {
 
 _VISION_LANGUAGE_MODELS = {
     "llava-hf/llava-1.5-7b-hf": LlavaForConditionalGeneration,
+    "llava-hf/llava-1.5-13b-hf": LlavaForConditionalGeneration,
 }
 
 
diff --git a/vllm/model_executor/models/__init__.py b/vllm/model_executor/models/__init__.py
index 17fc970..54a077d 100755
--- a/vllm/model_executor/models/__init__.py
+++ b/vllm/model_executor/models/__init__.py
@@ -33,6 +33,8 @@ _MODELS = {
     "LlamaForCausalLM": ("llama", "LlamaForCausalLM"),
     "LlavaForConditionalGeneration":
     ("llava", "LlavaForConditionalGeneration"),
+    "LlavaForCausalLM":
+    ("llava", "LlavaForConditionalGeneration"),
     # For decapoda-research/llama-*
     "LLaMAForCausalLM": ("llama", "LlamaForCausalLM"),
     "MistralForCausalLM": ("llama", "LlamaForCausalLM"),

Update: I have tested with some other models:

There is no need to apply this patch to llava-hf/bakLlava-v1-hf since it has LlavaForConditionalGeneration architecture as advertised.
Even with this patch, I cannot load models with LlavaLlamaForCausalLM architecture, notably liuhaotian/llava-v1.5-7b.

stikkireddy · 2024-04-11T13:41:32Z

hmm okay then i can probably clone their model and just modify the config.json to use the LlavaForConditionalGeneration architecture.

DarkLight1337 · 2024-04-11T14:29:19Z

I have updated my PR accordingly.

Jianzhao-Huang · 2024-04-16T11:25:12Z

I think it's just a typo in their HuggingFace config.json. I can successfully load the 13b model in vLLM by pretending that it has LlavaForConditionalGeneration architecture. I applied the following patch on top of DarkLight1337:openai-vision-api:
diff --git a/tests/conftest.py b/tests/conftest.py
index a7e8963..61ca4fe 100644
--- a/tests/conftest.py
+++ b/tests/conftest.py
@@ -131,6 +131,7 @@ _STR_DTYPE_TO_TORCH_DTYPE = {
 
 _VISION_LANGUAGE_MODELS = {
     "llava-hf/llava-1.5-7b-hf": LlavaForConditionalGeneration,
+    "llava-hf/llava-1.5-13b-hf": LlavaForConditionalGeneration,
 }
 
 
diff --git a/vllm/model_executor/models/__init__.py b/vllm/model_executor/models/__init__.py
index 17fc970..54a077d 100755
--- a/vllm/model_executor/models/__init__.py
+++ b/vllm/model_executor/models/__init__.py
@@ -33,6 +33,8 @@ _MODELS = {
     "LlamaForCausalLM": ("llama", "LlamaForCausalLM"),
     "LlavaForConditionalGeneration":
     ("llava", "LlavaForConditionalGeneration"),
+    "LlavaForCausalLM":
+    ("llava", "LlavaForConditionalGeneration"),
     # For decapoda-research/llama-*
     "LLaMAForCausalLM": ("llama", "LlamaForCausalLM"),
     "MistralForCausalLM": ("llama", "LlamaForCausalLM"),
Update: I have tested with some other models:

There is no need to apply this patch to llava-hf/bakLlava-v1-hf since it has LlavaForConditionalGeneration architecture as advertised.

Even with this patch, I cannot load models with LlavaLlamaForCausalLM architecture, notably liuhaotian/llava-v1.5-7b.

Thank you for your effort in supporting llava in vllm! Indeed this is a great job.

After checking the code in original llava code repo(https://github.com/haotian-liu/LLaVA), I found exactly llava-1.5-13b use the LlavaForCausalLM class, so I think it's not a typo in the HuggingFace's config.json.

I don't know whether there will be a runtime bug or some performance losses after directly modifying the config.json to use the LlavaForConditionalGeneration architecture in llava-1.5-13b. Could you please check this issue?

DarkLight1337 · 2024-04-16T12:22:16Z

Thank you for your effort in supporting llava in vllm! Indeed this is a great job.

After checking the code in original llava code repo(https://github.com/haotian-liu/LLaVA), I found exactly llava-1.5-13b use the LlavaForCausalLM class, so I think it's not a typo in the HuggingFace's config.json.

I don't know whether there will be a runtime bug or some performance losses after directly modifying the config.json to use the LlavaForConditionalGeneration architecture in llava-1.5-13b. Could you please check this issue?

@Jianzhao-Huang It doesn't look like HuggingFace has LlavaForCausalLM, so I think both models should use LlavaForConditionalGeneration as the name.

In any case, I have just pushed a commit to #3978 which adds the 13b model to the LLaVA test case that checks its consistency against the native HuggingFace model.

DarkLight1337 · 2024-04-17T05:54:23Z

Hmm, seems that the GPU memory fails to be freed between testing each model. Does anyone know how to fix this issue?

DarkLight1337 · 2024-04-18T06:20:29Z

I have investigated further and it seems that the CI/CD infrastructure cannot even load the 13B model into memory (I removed all other LLaVA models from the test and it still OOMed). Not sure what I can do about that...

Iven2132 · 2024-04-22T16:47:44Z

Any fix?

DarkLight1337 · 2024-04-28T12:21:25Z

The incorrect architecture in config.json has been fixed on HuggingFace, as per this discussion.

stikkireddy added the bug Something isn't working label Apr 11, 2024

DarkLight1337 mentioned this issue Apr 11, 2024

[Core][Frontend][Doc] Initial support for LLaVA-NeXT and GPT-4V Chat Completions API #3978

Closed

7 tasks

DarkLight1337 mentioned this issue Apr 19, 2024

[Frontend] Support GPT-4V Chat Completions API #4200

Closed

DarkLight1337 closed this as completed May 31, 2024

DarkLight1337 mentioned this issue Jun 3, 2024

[Feature]: Option to override HuggingFace's configurations #5205

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Model architectures ['LlavaForCausalLM'] are not supported for now in vllm 0.4.0.post1 #4008

[Bug]: Model architectures ['LlavaForCausalLM'] are not supported for now in vllm 0.4.0.post1 #4008

stikkireddy commented Apr 11, 2024

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024

stikkireddy commented Apr 11, 2024

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024 •

edited

Loading

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024

Jianzhao-Huang commented Apr 16, 2024 •

edited

Loading

DarkLight1337 commented Apr 16, 2024 •

edited

Loading

DarkLight1337 commented Apr 17, 2024 •

edited

Loading

DarkLight1337 commented Apr 18, 2024 •

edited

Loading

Iven2132 commented Apr 22, 2024

DarkLight1337 commented Apr 28, 2024

[Bug]: Model architectures ['LlavaForCausalLM'] are not supported for now in vllm 0.4.0.post1 #4008

[Bug]: Model architectures ['LlavaForCausalLM'] are not supported for now in vllm 0.4.0.post1 #4008

Comments

stikkireddy commented Apr 11, 2024

Your current environment

🐛 Describe the bug

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024

stikkireddy commented Apr 11, 2024

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024 • edited Loading

stikkireddy commented Apr 11, 2024

DarkLight1337 commented Apr 11, 2024

Jianzhao-Huang commented Apr 16, 2024 • edited Loading

DarkLight1337 commented Apr 16, 2024 • edited Loading

DarkLight1337 commented Apr 17, 2024 • edited Loading

DarkLight1337 commented Apr 18, 2024 • edited Loading

Iven2132 commented Apr 22, 2024

DarkLight1337 commented Apr 28, 2024

DarkLight1337 commented Apr 11, 2024 •

edited

Loading

Jianzhao-Huang commented Apr 16, 2024 •

edited

Loading

DarkLight1337 commented Apr 16, 2024 •

edited

Loading

DarkLight1337 commented Apr 17, 2024 •

edited

Loading

DarkLight1337 commented Apr 18, 2024 •

edited

Loading