-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support repetition_penalty #1424
Conversation
Hi @beginlner, Could you tell us how it is different from #1392? |
Hi, I think we implemented completely identical functions. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thank you for your contribution! Added a small style fix.
In huggingface, input_token_ids contains prompt tokens. |
I'm getting inconsistent results between HF and vllm with llama-7b @beginlner @WoosukKwon
|
Use repetition_penalty instead of frequency_penalty in vllm. And another defect is input_ids inconsistent. |
@resorcap Hi, the result of repetition_penalty is still inconsistent
|
It has the same behavior as this.