Support internlm2 model #2527

esmeetu · 2024-01-21T14:02:05Z

Model info: https://huggingface.co/internlm/internlm2-chat-7b

Currently this doesn't work using TP2 which output garbled text, and i haven't test on TP1.

Test code:

from vllm import LLM, SamplingParams

prompts = [
    "<s><|im_start|>user\nhello<|im_end|>\n<|im_start|>assistant\n"
]
sampling_params = SamplingParams(temperature=0.0, max_tokens=64)

llm = LLM(model="internlm/internlm2-chat-7b", trust_remote_code=True, tensor_parallel_size=2, dtype="half", enforce_eager=True)
outputs = llm.generate(prompts, sampling_params)

# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

Result:

Prompt: '<s><|im_start|>user\nhello<|im_end|>\n<|im_start|>assistant\n', Generated text: ' that that that that that that that which which which which which which which which which which which  and and and and and������������� <|im_end|> 是� <|im_end|> 是���是� <|im_end|> 是����� <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|> <|im_end|>'

Official Result:

Hello! How can I assist you today?

~~I don't know why self.wqkv(hidden_states) result is not right. Could someone help based my current PR?~~

esmeetu · 2024-01-28T07:56:37Z

Hi, @zhuohan123. This PR currently works after some fix on loading weight. But i have two question:

Does from einops import rearrange could be replaced by torch's function? Therefore we can remove this dependency.
Could you have any ideas about the difference between the wqkv weight file format with vLLM's inner default qkv_proj weight based my weight loading implement.
Anyway, i will explore more about these.

esmeetu · 2024-01-30T14:31:39Z

Close this since there's a better support in #2666.

esmeetu added 4 commits January 21, 2024 21:55

init support

7639ab7

format

3359c60

Merge remote-tracking branch 'upstream/main' into internlm2

07389b4

fix wqkv weight load

ddf5688

esmeetu changed the title ~~[WIP] Support internlm2~~ Support internlm2 model Jan 28, 2024

esmeetu marked this pull request as ready for review January 28, 2024 07:46

Leymore mentioned this pull request Jan 30, 2024

Add Internlm2 #2666

Merged

esmeetu closed this Jan 30, 2024

esmeetu deleted the internlm2 branch February 14, 2024 09:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support internlm2 model #2527

Support internlm2 model #2527

esmeetu commented Jan 21, 2024 •

edited

Loading

esmeetu commented Jan 28, 2024

esmeetu commented Jan 30, 2024

Support internlm2 model #2527

Support internlm2 model #2527

Conversation

esmeetu commented Jan 21, 2024 • edited Loading

esmeetu commented Jan 28, 2024

esmeetu commented Jan 30, 2024

esmeetu commented Jan 21, 2024 •

edited

Loading