-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ CB ][ SD ] Support streaming with using stop_strings
and include_stop_strings
#1382
Merged
iefode
merged 24 commits into
openvinotoolkit:master
from
iefode:streaming_stop_strings
Dec 20, 2024
Merged
[ CB ][ SD ] Support streaming with using stop_strings
and include_stop_strings
#1382
iefode
merged 24 commits into
openvinotoolkit:master
from
iefode:streaming_stop_strings
Dec 20, 2024
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
iefode
changed the title
[ CB ][ SD ] Implement streaming with using
[ CB ][ SD ] Implement streaming with using Dec 13, 2024
stop_strings
github-actions
bot
added
category: continuous batching
Continuous batching
category: sampling
Sampling / Decoding algorithms
category: speculative decoding
Speculative decoding
category: samples
GenAI samples
labels
Dec 13, 2024
iefode
force-pushed
the
streaming_stop_strings
branch
from
December 16, 2024 18:22
72d9348
to
0e91fae
Compare
github-actions
bot
added
category: LLM
LLM pipeline (stateful, static)
no-match-files
and removed
category: samples
GenAI samples
labels
Dec 16, 2024
iefode
changed the title
[ CB ][ SD ] Implement streaming with using
[ CB ][ SD ] Implement streaming with using Dec 17, 2024
stop_strings
stop_strings
and include_stop_strings
via streamer & generation handling
github-actions
bot
added
the
category: GenAI C++ API
Changes in GenAI C++ public headers
label
Dec 17, 2024
github-actions
bot
removed
category: LLM
LLM pipeline (stateful, static)
category: speculative decoding
Speculative decoding
category: GenAI C++ API
Changes in GenAI C++ public headers
labels
Dec 17, 2024
…enai into streaming_stop_strings
iefode
changed the title
[ CB ][ SD ] Implement streaming with using
[ CB ][ SD ] Support streaming with using Dec 20, 2024
stop_strings
and include_stop_strings
via streamer & generation handlingstop_strings
and include_stop_strings
samples/cpp/speculative_decoding_lm/speculative_decoding_lm.cpp
Outdated
Show resolved
Hide resolved
ilya-lavrenov
approved these changes
Dec 20, 2024
github-merge-queue
bot
removed this pull request from the merge queue due to failed status checks
Dec 20, 2024
github-merge-queue
bot
removed this pull request from the merge queue due to failed status checks
Dec 20, 2024
github-merge-queue
bot
removed this pull request from the merge queue due to failed status checks
Dec 20, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
category: continuous batching
Continuous batching
category: LLM
LLM pipeline (stateful, static)
category: sampling
Sampling / Decoding algorithms
no-match-files
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Details::
stop_strings
in CB like pipelinesstop_string_match
logic to encode them only once per requeststop_string
(Tests was a bit changes in this case according HF does not support excludestop_strings
)Tickets: