Add Qwen pipeline and example #12292

hkvision · 2024-10-29T12:57:06Z

No description provided.

rnwang04

others LGTM.

rnwang04 · 2024-10-30T13:26:37Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/Pipeline-Models/qwen2.py

+    parser.add_argument(
+        "--repo-id-or-model-path",
+        type=str,
+        default="Qwen/Qwen2-7B-Instruct",


If "Qwen/Qwen2.5-7B-Instruct" can directly run with current code, maybe use "Qwen/Qwen2.5-7B-Instruct" as default ?
We can also update this in next PR 😊

Sure, changed to 2.5 as default and renamed the file to qwen.py
Will test qwen2.5 soon

rnwang04 · 2024-10-30T13:27:25Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/Pipeline-Models/qwen2.py

+        default="Qwen/Qwen2-7B-Instruct",
+        help="The huggingface repo id for the Baichuan2 model to be downloaded"
+        ", or the path to the huggingface checkpoint folder",
+    )


Maybe add parser.add_argument("--lowbit-path", type=str, ... here.

rnwang04 · 2024-10-30T13:28:10Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/Pipeline-Models/qwen2.py

+    parser.add_argument("--n-predict", type=int, default=32, help="Max tokens to predict")
+    parser.add_argument("--max-context-len", type=int, default=1024)
+    parser.add_argument("--max-prompt-len", type=int, default=960)
+    parser.add_argument("--disable-transpose-value-cache", action="store_true", default=False)


It looks like Qwen2 GW is already ready ? If so, maybe add parser.add_argument("--quantization_group_size", type=int, default=0) here ?

Not tested yet, will test and update in a following PR.

plusbang

Please also update readme, other LGTM.

hkvision · 2024-10-31T03:23:13Z

Please also update readme, other LGTM.

README updated.

hkvision · 2024-10-31T03:23:51Z

Merge it first. Will fix any issues in a following PR.

hkvision requested review from plusbang, rnwang04 and cyita October 30, 2024 11:33

hkvision force-pushed the qwen-pipeline branch from e8dde16 to 489d351 Compare October 30, 2024 11:58

hkvision added 2 commits October 30, 2024 20:04

support qwen pipeline

489d351

update error msg

1101121

rnwang04 approved these changes Oct 30, 2024

View reviewed changes

plusbang approved these changes Oct 31, 2024

View reviewed changes

cyita approved these changes Oct 31, 2024

View reviewed changes

style

254088a

meet review

ee2b07a

hkvision merged commit 416c191 into intel-analytics:main Oct 31, 2024
1 check passed

hkvision deleted the qwen-pipeline branch October 31, 2024 03:25

minor

f83618d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen pipeline and example #12292

Add Qwen pipeline and example #12292

hkvision commented Oct 29, 2024

rnwang04 left a comment

rnwang04 Oct 30, 2024

hkvision Oct 31, 2024

rnwang04 Oct 30, 2024

rnwang04 Oct 30, 2024

hkvision Oct 31, 2024

plusbang left a comment

hkvision commented Oct 31, 2024

hkvision commented Oct 31, 2024

Add Qwen pipeline and example #12292

Add Qwen pipeline and example #12292

Conversation

hkvision commented Oct 29, 2024

rnwang04 left a comment

Choose a reason for hiding this comment

rnwang04 Oct 30, 2024

Choose a reason for hiding this comment

hkvision Oct 31, 2024

Choose a reason for hiding this comment

rnwang04 Oct 30, 2024

Choose a reason for hiding this comment

rnwang04 Oct 30, 2024

Choose a reason for hiding this comment

hkvision Oct 31, 2024

Choose a reason for hiding this comment

plusbang left a comment

Choose a reason for hiding this comment

hkvision commented Oct 31, 2024

hkvision commented Oct 31, 2024