Add repetition_penalty aligned with huggingface #866

Abraham-Xu · 2023-08-25T06:51:11Z

In huggingface/transformers generate method:
https://github.com/huggingface/transformers/blob/ae320fa53f74cc4dfa0e4fc3c95b6129a86b0512/src/transformers/generation/utils.py#L1295

it prepares a logits_processor from generation_config
https://github.com/huggingface/transformers/blob/ae320fa53f74cc4dfa0e4fc3c95b6129a86b0512/src/transformers/generation/utils.py#L1540
it does logits pre-process by logits_processor in greedy_search/sample/beam_search
https://github.com/huggingface/transformers/blob/ae320fa53f74cc4dfa0e4fc3c95b6129a86b0512/src/transformers/generation/utils.py#L2457

Specificaly, repetition penalty is a frequently used pre-process method in logits_processor. It prevents repetitions through a penalty. This penalized sampling works by discounting the scores of previously generated tokens. (https://arxiv.org/pdf/1909.05858.pdf)

I introduced the the repetition penalty pre-process to sampler.py aligned with huggingface implementation:
https://github.com/huggingface/transformers/blob/ae320fa53f74cc4dfa0e4fc3c95b6129a86b0512/src/transformers/generation/logits_process.py#L328

Scagin

I am waiting for this, I need some method to make hf and vllm compat when I use penalty arguments

Abraham-Xu · 2023-08-29T23:18:52Z

I have tested the repetition_penalty under greedy search mode by Llama 7b and Baichuan models. The results are all aligned with huggingface transformers.

vllm/sampling_params.py

Co-authored-by: Dong-Yong Lee <[email protected]>

markluofd · 2023-09-01T06:05:14Z

this pr solved my problem, for my llama2 model, repetition_penalty is necessary, otherwise the result is incorrect

TheBloke · 2023-09-02T12:10:53Z

Great! Looking forward to testing this as my perception is that vLLM is very poor with repetition currently, and that the existing repetition penalty params (freqency_penalty etc) do little or nothing.

Although part of that problem is that there's no per-request seed, something we also really need.

jeffchy · 2023-09-04T09:45:01Z

Beautiful！

BaiMoHan · 2023-09-04T11:03:43Z

Very nice

BaiMoHan · 2023-09-12T04:01:16Z

Hey, what's wrong?I did not find the repetition_penalty attribute in sampling_params.py when I use v0.1.7. I'm a bit impatient.🥺🥺🥺

leshanbog · 2023-09-13T08:52:50Z

vllm/model_executor/layers/sampler.py

@@ -162,30 +170,61 @@ def _apply_penalties(
        indices.append(i)

    # Return early if all sequences have zero penalties.


this comment is misleading now?

leshanbog · 2023-09-13T08:53:38Z

vllm/model_executor/layers/sampler.py

+        logits[indices] -= frequency_penalties.unsqueeze(dim=1) * bin_counts
+        presence_mask = (bin_counts > 0.0).to(dtype=logits.dtype)
+        logits[indices] -= presence_penalties.unsqueeze(dim=1) * presence_mask
+    else:


why else? will presence_penalty and repetition_penalty work together?

heshuguo · 2023-09-18T00:34:49Z

@WoosukKwon please check if the fix can be merged.

WoosukKwon · 2023-09-18T18:07:53Z

@Abraham-Xu @heshuguo This PR is a bit in conflict with #1048, which tries to refactor the sampler code. @zhuohan123 Can we merge this PR before yours?

WoosukKwon · 2023-10-03T02:30:22Z

Hi @Abraham-Xu, sorry for the late response.

Could you update the PR? There are some merge conflicts because we refactored sampler in #1048.

jessiewiswjc · 2023-10-19T08:13:29Z

I want to know if presence_penalty/frequency_penalty in vllm can work with repetition_penalty in hf at the same time?

zhuohan123 · 2023-10-29T17:04:55Z

Supported with #1424. But again, thank you for your contribution! Let us know if there is any other issue.

Abraham-Xu · 2023-10-30T02:34:38Z

Supported with #1424. But again, thank you for your contribution! Let us know if there is any other issue.

Sorry for late reply, busy with working and other things recently. It is great to support repetition penalty in any way.

Abraham-Xu force-pushed the add_repetition_penalty branch 4 times, most recently from 8e78387 to b48f6b0 Compare August 27, 2023 12:55

Add repetition_penalty aligned with huggingface

e428cdd

Abraham-Xu force-pushed the add_repetition_penalty branch from b48f6b0 to e428cdd Compare August 27, 2023 12:56

Scagin approved these changes Aug 28, 2023

View reviewed changes

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

pfldy2850 reviewed Aug 30, 2023

View reviewed changes

vllm/sampling_params.py Show resolved Hide resolved

Update vllm/sampling_params.py

85a66bb

Co-authored-by: Dong-Yong Lee <[email protected]>

WoosukKwon self-requested a review September 1, 2023 06:20

NaCloudAI approved these changes Sep 6, 2023

View reviewed changes

leshanbog reviewed Sep 13, 2023

View reviewed changes

WoosukKwon mentioned this pull request Sep 28, 2023

Support per-request seed #1211

Closed

yanxiyue mentioned this pull request Oct 12, 2023

Potential degredation in sampling/too repetitive #712

Closed

WoosukKwon mentioned this pull request Oct 13, 2023

[v0.2.1] Release Tracker #1346

Closed

3 tasks

zhuohan123 closed this Oct 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add repetition_penalty aligned with huggingface #866

Add repetition_penalty aligned with huggingface #866

Abraham-Xu commented Aug 25, 2023 •

edited

Loading

Scagin left a comment

Abraham-Xu commented Aug 29, 2023

markluofd commented Sep 1, 2023

TheBloke commented Sep 2, 2023

jeffchy commented Sep 4, 2023

BaiMoHan commented Sep 4, 2023

BaiMoHan commented Sep 12, 2023 •

edited

Loading

leshanbog Sep 13, 2023

leshanbog Sep 13, 2023 •

edited

Loading

heshuguo commented Sep 18, 2023

WoosukKwon commented Sep 18, 2023

WoosukKwon commented Oct 3, 2023

jessiewiswjc commented Oct 19, 2023

zhuohan123 commented Oct 29, 2023

Abraham-Xu commented Oct 30, 2023

		@@ -162,30 +170,61 @@ def _apply_penalties(
		indices.append(i)

		# Return early if all sequences have zero penalties.

Add repetition_penalty aligned with huggingface #866

Add repetition_penalty aligned with huggingface #866

Conversation

Abraham-Xu commented Aug 25, 2023 • edited Loading

Scagin left a comment

Choose a reason for hiding this comment

Abraham-Xu commented Aug 29, 2023

markluofd commented Sep 1, 2023

TheBloke commented Sep 2, 2023

jeffchy commented Sep 4, 2023

BaiMoHan commented Sep 4, 2023

BaiMoHan commented Sep 12, 2023 • edited Loading

leshanbog Sep 13, 2023

Choose a reason for hiding this comment

leshanbog Sep 13, 2023 • edited Loading

Choose a reason for hiding this comment

heshuguo commented Sep 18, 2023

WoosukKwon commented Sep 18, 2023

WoosukKwon commented Oct 3, 2023

jessiewiswjc commented Oct 19, 2023

zhuohan123 commented Oct 29, 2023

Abraham-Xu commented Oct 30, 2023

Abraham-Xu commented Aug 25, 2023 •

edited

Loading

BaiMoHan commented Sep 12, 2023 •

edited

Loading

leshanbog Sep 13, 2023 •

edited

Loading