-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
T5 model support #404
Comments
76 tasks
Any updates reg this? |
In #187 (comment), js8544 mentioned that his team would be working on it in November, but no updates have come since then. |
Closing as duplicate of #187 |
jikunshang
pushed a commit
to jikunshang/vllm
that referenced
this issue
Oct 24, 2024
…t#404) This PR removes the usage of custom HPU RotaryEmbedding modules, and adds a forward_hpu method to existing RotaryEmbedding, for reusing multiple derived implementations without the need of adding them to HPU extension. Mark_steps should not be needed within the test, but for whatever reason, if they are not there, PT bridge crashes. To be investigated later on. It does not affect actual model execution in any way I could test/observe.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Any plans on adding T5 models?
FLAN-T5 are particularly interesting
The text was updated successfully, but these errors were encountered: