Support Qwen in model builder #739

BowenBao · 2024-07-31T19:17:58Z

Resolves #718

cc @spandantiwari-amd

BowenBao · 2024-07-31T19:18:18Z

cc @kunal-vaishnavi for review

kunal-vaishnavi · 2024-07-31T19:46:31Z

Thanks for the contribution! Can you also update the following places?

Add the qwen2 model type that will show up in genai_config.json here
Add Qwen to the README here to show that Qwen is now supported

BowenBao · 2024-07-31T21:22:23Z

Thanks @kunal-vaishnavi, updated.

I realized we will need to introduce the Qwen tokenizer as well for it to run e2e in OGA. I'm putting Qwen in "Under Development" column for now.

yufenglee · 2024-07-31T21:53:33Z

Thanks @kunal-vaishnavi, updated.

I realized we will need to introduce the Qwen tokenizer as well for it to run e2e in OGA. I'm putting Qwen in "Under Development" column for now.

What tokenizer does Qwen use? Does it use same tokenizer as llama?

yufenglee · 2024-07-31T22:07:56Z

Thanks @kunal-vaishnavi, updated.
I realized we will need to introduce the Qwen tokenizer as well for it to run e2e in OGA. I'm putting Qwen in "Under Development" column for now.

What tokenizer does Qwen use? Does it use same tokenizer as llama?

Synced up with @wenbingl. There needs a small change to support Qwen tokenizer. He will add the change.

wenbingl · 2024-07-31T22:23:18Z

Thanks @kunal-vaishnavi, updated.
I realized we will need to introduce the Qwen tokenizer as well for it to run e2e in OGA. I'm putting Qwen in "Under Development" column for now.

What tokenizer does Qwen use? Does it use same tokenizer as llama?

Synced up with @wenbingl. There needs a small change to support Qwen tokenizer. He will add the change.

a quick PR is here: microsoft/onnxruntime-extensions#781

BowenBao · 2024-07-31T22:46:11Z

Thank you for the quick response @wenbingl, @yufenglee! I have validated your fix enables the qwen generation e2e.

Should I wait for the PR in extension to merge, and then include the commit update in this PR?

kunal-vaishnavi · 2024-07-31T23:06:32Z

Should I wait for the PR in extension to merge, and then include the commit update in this PR?

Yes, let's wait so that Qwen can be supported end-to-end with this PR.

wenbingl · 2024-08-01T00:51:47Z

Thank you for the quick response @wenbingl, @yufenglee! I have validated your fix enables the qwen generation e2e.

Should I wait for the PR in extension to merge, and then include the commit update in this PR?

the PR in ort-extensions was merged.

BowenBao · 2024-08-01T03:19:55Z

Updated. @kunal-vaishnavi PTAL.

yufenglee requested review from wangyems and kunal-vaishnavi July 31, 2024 21:58

BowenBao added 3 commits August 1, 2024 03:16

Support Qwen in model builder

bfeaece

Update README and supported model type in generator

225dfbb

Bump onnxruntime-extensions commit to include fix for qwen tokenizer

51a83f0

BowenBao force-pushed the bowenbao/qwen branch from 41f25d3 to 51a83f0 Compare August 1, 2024 03:18

kunal-vaishnavi approved these changes Aug 1, 2024

View reviewed changes

yufenglee approved these changes Aug 1, 2024

View reviewed changes

yufenglee merged commit 1fb20f8 into microsoft:main Aug 1, 2024
11 of 13 checks passed

kunal-vaishnavi mentioned this pull request Aug 4, 2024

[Build] genai_config #754

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Qwen in model builder #739

Support Qwen in model builder #739

BowenBao commented Jul 31, 2024 •

edited

Loading

BowenBao commented Jul 31, 2024

kunal-vaishnavi commented Jul 31, 2024

BowenBao commented Jul 31, 2024

yufenglee commented Jul 31, 2024

yufenglee commented Jul 31, 2024

wenbingl commented Jul 31, 2024

BowenBao commented Jul 31, 2024

kunal-vaishnavi commented Jul 31, 2024

wenbingl commented Aug 1, 2024

BowenBao commented Aug 1, 2024

Support Qwen in model builder #739

Support Qwen in model builder #739

Conversation

BowenBao commented Jul 31, 2024 • edited Loading

BowenBao commented Jul 31, 2024

kunal-vaishnavi commented Jul 31, 2024

BowenBao commented Jul 31, 2024

yufenglee commented Jul 31, 2024

yufenglee commented Jul 31, 2024

wenbingl commented Jul 31, 2024

BowenBao commented Jul 31, 2024

kunal-vaishnavi commented Jul 31, 2024

wenbingl commented Aug 1, 2024

BowenBao commented Aug 1, 2024

BowenBao commented Jul 31, 2024 •

edited

Loading