Bug-fixing and interface update #2

vvchernov · 2023-12-27T13:49:17Z

Refactor for octoml#82 PR

Rebase to batch-serving
Additional update new OpenAI API for logprobs
Fix small issues
Fix mypy

* wip * wip * fixed * fix missing finish reason for EOS and do length check early * optimize stop sequence handling * fixed * use list for stopped seqs * Return FinishReason.Stop with last non-empty delta * add a lock around stopped_sequences processing * remove sequence_map entries after a request is finished * fix handler

* wip * bring back assert * fix overcounted partially shared token counts

* Add sampling penalties and logit bias This PR adds the sampling penlaties of frequency penalty and presence penalty. Also it adds the logit bias. * fix * fix lint * vectorized sampling * apply code review suggestions * fix

This PR fixes the `SamplingParams.logit_bias_index` and `SamplingParams.logit_bias_value` if `SamplingParams.logit_bias` is `None`.

* remove mlc_llm dependency * use staging engine by default * wip * done

This PR fixes the frequency/presence penalty for first token.

add repetition penalty

* Update exception chaining for mlc-serve * Add more support for using contextvars

surpress noisy logging

* Fix async_connector.py * Fix * --amend

Init with tests. Server working. Major fix, serve working great. Minor fix and tests. Remove extra line. fix log_softmax use constant for number of top logprobs small clean upstream to new OpenAI API Co-authored-by: Valery Chernov <[email protected]>

…update_logprob

zxybazh

Thanks @vvchernov for sending in this patch! LGTM.

masahi and others added 17 commits December 19, 2023 19:32

Simplify allocated token counts management (mlc-ai#127)

56562e9

* wip * bring back assert * fix overcounted partially shared token counts

Add sampling penalties and logit bias (mlc-ai#125)

624a99a

* Add sampling penalties and logit bias This PR adds the sampling penlaties of frequency penalty and presence penalty. Also it adds the logit bias. * fix * fix lint * vectorized sampling * apply code review suggestions * fix

Fix logit_bias index and value (mlc-ai#128)

d10a452

This PR fixes the `SamplingParams.logit_bias_index` and `SamplingParams.logit_bias_value` if `SamplingParams.logit_bias` is `None`.

[Refactor] Misc Code Improvements (mlc-ai#129)

e87f690

* remove mlc_llm dependency * use staging engine by default * wip * done

Fix frequency/presence penalty (mlc-ai#130)

7de88ee

This PR fixes the frequency/presence penalty for first token.

Add repetition penalty (mlc-ai#131)

3270d50

add repetition penalty

Update exception chaining for mlc-serve (mlc-ai#132)

3d7adc3

A few more logging changes including context vars. (mlc-ai#133)

f2035c3

* Update exception chaining for mlc-serve * Add more support for using contextvars

Fix length discrepancy for decode after cache eviction (mlc-ai#134)

7517c02

Surppress noisy logging (mlc-ai#135)

b9ef62d

surpress noisy logging

Fix async_connector.py (mlc-ai#136)

6d79c1d

* Fix async_connector.py * Fix * --amend

fixes from Iliya Kozulin

01e5bb2

fix logprobs with zero dimension

1d38559

Fix token decoding for top logprobs in response

513a86b

update logprob response classes

214e610

vvchernov force-pushed the vc/update_logprob branch from 97d0a1b to 214e610 Compare December 28, 2023 11:49

Merge branch 'feature/2023-11-22/enable-mlc-server-logprobs' into vc/…

43b4625

…update_logprob

vvchernov force-pushed the vc/update_logprob branch 7 times, most recently from ea7a66c to 901b16e Compare December 29, 2023 07:07

several fixes and clean to align mypy

cac70ac

vvchernov force-pushed the vc/update_logprob branch from 901b16e to cac70ac Compare December 29, 2023 08:59

vvchernov mentioned this pull request Dec 29, 2023

Enable Logprobs in MLC Batch Serving octoml/mlc-llm#82

Merged

zxybazh approved these changes Dec 31, 2023

View reviewed changes

zxybazh merged commit 5bb9fd2 into zxybazh:feature/2023-11-22/enable-mlc-server-logprobs Dec 31, 2023
1 check passed

vvchernov deleted the vc/update_logprob branch January 4, 2024 09:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug-fixing and interface update #2

Bug-fixing and interface update #2

vvchernov commented Dec 27, 2023 •

edited

Loading

zxybazh left a comment

Bug-fixing and interface update #2

Bug-fixing and interface update #2

Conversation

vvchernov commented Dec 27, 2023 • edited Loading

zxybazh left a comment

Choose a reason for hiding this comment

vvchernov commented Dec 27, 2023 •

edited

Loading