-
-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Min_tokens #688
Comments
I believe this is possible. You should be able to force the generation not to finish in this function: vllm/vllm/engine/llm_engine.py Line 419 in 79af7e9
It would be great if you can contribute this feature! |
So, has it been implemented now? |
You could use a negative length penalty Lines 63 to 64 in 925f333
|
This has now been implemented and will be in the next version: #3124 |
Hi ! Thanks for your work,
A quick question is it possible to have a min_token parameters ?
Thanks
The text was updated successfully, but these errors were encountered: