[Inference] Generate multiple sequences for single prompt #52

xwu99 · 2024-01-11T08:12:36Z

No description provided.

xwu99 · 2024-01-17T14:39:58Z

@KepingYan could you update this to streamline vllm output after #20 is merged. There are two options:

Option 1:

request include single prompt: return single output
request include multiple prompts: return outputs as list

Option 2:

request include single prompt or multiple prompts both return output(s) as list.

Could you check how OpenAI deal with multiple prompts and we can align with it.?

…atron DDP pretrainers (intel#52) * add eval loader Signed-off-by: Zhi Lin <[email protected]> * use step saved in checkpoint in megatron dataset Signed-off-by: Zhi Lin <[email protected]> --------- Signed-off-by: Zhi Lin <[email protected]>

xwu99 changed the title ~~[Inference] generate multiple sequences for single prompt~~ [Inference] Generate multiple sequences for single prompt Jan 11, 2024

xwu99 mentioned this issue Jan 17, 2024

Add vllm Predictor #20

Merged

xwu99 linked a pull request Jan 29, 2024 that will close this issue

[Serving] Support multiple prompts for generate #62

Merged

xwu99 closed this as completed Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] Generate multiple sequences for single prompt #52

[Inference] Generate multiple sequences for single prompt #52

xwu99 commented Jan 11, 2024

xwu99 commented Jan 17, 2024 •

edited

Loading

[Inference] Generate multiple sequences for single prompt #52

[Inference] Generate multiple sequences for single prompt #52

Comments

xwu99 commented Jan 11, 2024

xwu99 commented Jan 17, 2024 • edited Loading

xwu99 commented Jan 17, 2024 •

edited

Loading