-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Model][OpenVINO] Fix regressions from #8346 #10045
Conversation
Signed-off-by: Peter Salas <[email protected]>
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
Thanks for the quick fix! Can you also fix the dummy data for Molmo in this PR? It is reported by #10042. |
I'm heading off soon. @ywang96 can you help with this? |
Signed-off-by: Peter Salas <[email protected]>
Done! |
To save CI costs, I've cancelled fastcheck CI - only AWS CI has to pass for the PR to be eligible for merge. |
…m-project#10045)" This reverts commit ffc0f2b.
…t#10045) Signed-off-by: Peter Salas <[email protected]> Signed-off-by: Loc Huynh <[email protected]>
…t#10045) Signed-off-by: Peter Salas <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>
Fix #10025 by adding placeholder index maps to
OpenVINOAttentionMetadata
.Fix #10042 by returning
DummyData
instead of a tuple.(As per the comment that I copied from
AttentionMetadata
, this doesn't really belong here and should probably be onModelInputs
instead, but I didn't want to diverge the OpenVINO/non-OpenVINO paths further.)