Align with huggingface beam search #646

hsm1997 · 2023-08-02T11:39:49Z

main modifications

sampler
- always sample 2 * num_beams tokens, even if beam_width < num_beams
scheduler
- update: keep at-most num_beams finished seqs (beam_hyps) and at-most num_beams running seqs in seq_group.
- move _decode and _stop function to scheduler (from llm_engine).
sequence
- move "data" attributes from Sequence to SequenceData

example

prompt: "What is deep learning?"
sampling_params: SamplingParams(temperature = 0, use_beam_search = True, n = 5, max_tokens = 100)
model & tokenizer: A finetuned Llama
original beam search returns:

 * Learning is deep type of machine learning that involves using building and neural networks to model and solve complex problems.
 * Deep learning is a subset of artificial intelligence that involves the use of artificial neural networks to model and solve complex problems.
 * It's a type of Artificial allows training artificial neural networks to recognize patterns in data. problems that require
 * the not a word machine learning that involves building and training neural networks to model and generalize knowledge problems that Deep learning is inspired by the structure and function of the human brain and is used for a variety of applications including natural language processing, image and speech recognition, and decision making.
 * A is a the subfield of artificial intelligence that involves the use of artificial neural networks to model and solve complex problems. Deep learning is inspired by the structure and function of the human brain and is used for a variety of applications including natural language processing, image and speech recognition, and decision making.

modified beam search:

 * Deep learning is a subfield of artificial intelligence that involves the use of artificial neural networks to model and solve problems that require high-level processing, such as natural language processing, image and speech recognition, and decision making.
 * Deep learning is a subset of artificial intelligence that involves the use of artificial neural networks to model and solve complex problems. It is inspired by the structure and function of the human brain and is used for a variety of applications such as natural language processing, image and speech recognition, and decision making.
 * Deep learning is a subfield of artificial intelligence that involves the use of artificial neural networks to model and solve complex problems. Deep learning is inspired by the structure and function of the human brain and is used for a variety of applications including natural language processing, image and speech recognition, and decision making.
 * Deep learning is a subset of artificial intelligence that involves the use of artificial neural networks to model and solve complex problems. It is inspired by the structure and function of the human brain and is used for a variety of applications such as natural language processing, image and speech recognition, and autonomous decision making.
 * Deep learning is a subfield of artificial intelligence that involves the use of artificial neural networks to model and solve problems that require high-level processing, such as natural language processing, image and speech recognition, and decision making. Deep learning is inspired by the structure and function of the human brain and is used to model and solve complex problems.

hsm1997 · 2023-08-02T12:10:49Z

related issue: #344 #644

zhuohan123 · 2023-08-09T22:15:47Z

@hsm1997 Thank you for your great contribution! The changes you make are a bit complicated. Can we schedule a chat to discuss about this PR? I cannot find your email address. Can you send me an email at zhuohan[at]berkeley.edu? Thanks again!

leiwen83 · 2023-08-12T14:04:52Z

A minor fix with this PR:

diff --git a/vllm/core/scheduler.py b/vllm/core/scheduler.py
index e2ca127..11eea5e 100644
--- a/vllm/core/scheduler.py
+++ b/vllm/core/scheduler.py
@@ -342,7 +342,7 @@ class Scheduler:
                     continue

                 # schedule next-beam tasks
-                pending = pending[:sampling_params.n]
+                pending = pending[:sampling_params.best_of]
                 running_ids = [
                     seq.seq_id for seq in seq_group.get_seqs(
                         status=SequenceStatus.RUNNING)

We need to keep the history with best_of setting, or we may lose the highest score, since n in sampling_params only means for the output token, not in the searching stage.

zhuohan123 · 2023-09-05T00:30:38Z

Close this PR since #857 is merged. Thanks @hsm1997 again for finding the issue and the draft PR!

hsm1997 changed the title ~~align with huggingface beam search.~~ align with huggingface beam search Aug 2, 2023

hsm1997 force-pushed the better_beam_search branch from 1558da0 to 086b848 Compare August 3, 2023 04:20

hsm1997 closed this Aug 3, 2023

hsm1997 force-pushed the better_beam_search branch 2 times, most recently from 086b848 to aa84c92 Compare August 3, 2023 04:22

hsm1997 reopened this Aug 3, 2023

thisissum mentioned this pull request Aug 3, 2023

beam search bug #644

Closed

hsm1997 force-pushed the better_beam_search branch 2 times, most recently from 5eb3ab4 to 2374db0 Compare August 7, 2023 09:15

align with huggingface beam search.

f1e5c4a

hsm1997 force-pushed the better_beam_search branch from 2374db0 to f1e5c4a Compare August 9, 2023 08:06

hsm1997 changed the title ~~align with huggingface beam search~~ Align with huggingface beam search Aug 9, 2023

zhuohan123 self-requested a review August 12, 2023 02:51

zhuohan123 mentioned this pull request Aug 24, 2023

Align vLLM's beam search implementation with HF generate #857

Merged

4 tasks

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

This was referenced Aug 31, 2023

Add tests for models #922

Merged

Bump up the version to v0.1.5 #944

Merged

zhuohan123 closed this Sep 5, 2023

efsotr mentioned this pull request Mar 19, 2024

[Bug]: Under the beam search setting, the output is abnormal #3498

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align with huggingface beam search #646

Align with huggingface beam search #646

hsm1997 commented Aug 2, 2023 •

edited

Loading

hsm1997 commented Aug 2, 2023 •

edited

Loading

zhuohan123 commented Aug 9, 2023

leiwen83 commented Aug 12, 2023 •

edited

Loading

zhuohan123 commented Sep 5, 2023

Align with huggingface beam search #646

Align with huggingface beam search #646

Conversation

hsm1997 commented Aug 2, 2023 • edited Loading

main modifications

example

hsm1997 commented Aug 2, 2023 • edited Loading

zhuohan123 commented Aug 9, 2023

leiwen83 commented Aug 12, 2023 • edited Loading

zhuohan123 commented Sep 5, 2023

hsm1997 commented Aug 2, 2023 •

edited

Loading

hsm1997 commented Aug 2, 2023 •

edited

Loading

leiwen83 commented Aug 12, 2023 •

edited

Loading