-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Continuous batching] Late token vector initialization in sampling #649
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general LGTM
" position of the Z-shaped groove?\n0.41\nWhat is the current position of the Z-shaped groove?\n0.11\n", | ||
" status of all of this? I can't stop thinking about it.\nIt's been a while since I've seen it. I found it a", | ||
" status of your blog? Do you accept feedback?\nYes, I’m happy to accept feedback at this time (I’m a" | ||
" condition of the leg?\nIt's been quite a while since I've seen it, so I didn't really know if it was good or bad", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A bit strange refs..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Multinomial tests for preemption fail on master, so perhaps there's something wrong going on with it and we get strange outputs.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
test_preemption.py passed here https://github.com/openvinotoolkit/openvino.genai/actions/runs/10056169378/job/27794414983?pr=666. But I didn't do anything specific.
Changes: