-
-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Model] Add support for Qwen2MoeModel #3346
Conversation
9962bfe
to
c900e04
Compare
@simon-mo hi, our codes are merged into HF transformers already. would you mind taking a look? |
@esmeetu Could you take a look? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left minor comments. And this look good to me.
@esmeetu it seems some unrelated tests failed, should we rerun it? or could you just merge this directly? |
@wenyujin333 There's a required check failed. Can you trigger this CI check again by submit a empty commit? |
Recently, the new codes of Qwen2MoE to Hugging Face Transformers, and thus I would love to contribute the new model to vLLM as well.
In this PR, I have provided the implementation of Qwen2MoE model and add some notes on it.