Add logits processors to enable logit_bias in OpenAI server #535

zacharyblank · 2023-07-20T17:02:49Z

This PR makes it possible to define custom logits processors to alter the probability of token generation based on user defined code. This allows for the vLLM OpenAI server to accept requests with logit_bias too.

The BiasLogitsProcessor is included specifically for the OpenAI server to handle requests with logit_bias.

…logits_processors

noamgat · 2023-10-31T18:03:15Z

vllm/model_executor/layers/sampler.py

+
+        if logits_processors is not None:
+            for logits_processor in logits_processors:
+                logits = logits_processor(logits, output_tokens)


In this call, you send output_tokens to logits_processor(). However, in the LogitsProcessor interface, the output_tokens parameter does not exist:

def __call__(self, logits: torch.tensor) -> torch.tensor:

How does it work?

simon-mo · 2023-11-30T18:54:32Z

Thank you for your PR! Now #1469 is merged with logits_processors API, can you help rebase and use that instead? The logit_bias support in OpenAI API will still be very useful for vLLM users.

Qubitium · 2024-02-05T01:32:38Z

@zacharyblank Any update? This will be greate for advanced users. vllm/outlines is nice but nothing beats pure code based state machine when it comes to performance.

esmeetu · 2024-03-25T12:47:21Z

Thanks for your contribution. Close this PR as this feature has been supported in #3027.

Co-authored-by: sang <[email protected]>

zacharyblank added 4 commits July 20, 2023 16:58

add logits processors to enable logit_bias in OpenAI server

a1dfbbd

Merge branch 'main' of https://github.com/vllm-project/vllm into add_…

050fd5f

…logits_processors

forgot to run format.sh

9234de5

add inoput ids to the logit processors

10a18cd

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

This was referenced Sep 29, 2023

Support for Contrastive Search #1219

Closed

JSON formatting issue #1191

Closed

Support for Constrained decoding #288

Closed

cnewton mentioned this pull request Oct 3, 2023

Custom LogitsProcessor/LogitsWarper huggingface/text-generation-inference#1050

Closed

noamgat reviewed Oct 31, 2023

View reviewed changes

noamgat mentioned this pull request Oct 31, 2023

Added logits processor API to sampling params #1469

Merged

viktor-ferenczi mentioned this pull request Nov 9, 2023

Attempt to pipe logit_bias to sampler's embedding_bias #1279

Closed

miku448 mentioned this pull request Nov 13, 2023

Add logit_bias support for OpenAI API endpoint PygmalionAI/aphrodite-engine#108

Merged

wdhitchc mentioned this pull request Dec 1, 2023

Integrate vLLM eth-sri/lmql#143

Open

esmeetu closed this Mar 25, 2024

rickyyx added a commit to rickyyx/vllm that referenced this pull request Oct 7, 2024

[integration] Make llama3 8B work with integration (vllm-project#535)

cbe65a2

Co-authored-by: sang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logits processors to enable logit_bias in OpenAI server #535

Add logits processors to enable logit_bias in OpenAI server #535

zacharyblank commented Jul 20, 2023

noamgat Oct 31, 2023

simon-mo commented Nov 30, 2023 •

edited

Loading

Qubitium commented Feb 5, 2024

esmeetu commented Mar 25, 2024

Add logits processors to enable logit_bias in OpenAI server #535

Add logits processors to enable logit_bias in OpenAI server #535

Conversation

zacharyblank commented Jul 20, 2023

noamgat Oct 31, 2023

Choose a reason for hiding this comment

simon-mo commented Nov 30, 2023 • edited Loading

Qubitium commented Feb 5, 2024

esmeetu commented Mar 25, 2024

simon-mo commented Nov 30, 2023 •

edited

Loading