Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revert "[BugFix] Fix tokenizer out of vocab size" #3740

Closed
wants to merge 1 commit into from

Conversation

simon-mo
Copy link
Collaborator

Reverts #3685

This impacted the performance of vLLM.

this commit cause the L4 opt benchmark to drop performance
before: https://buildkite.com/vllm/ci/builds/3695
after: https://buildkite.com/vllm/ci/builds/3716

before
Avg latency: 0.446163778666687 seconds
Throughput: 21.25 requests/s, 10879.19 tokens/s

after
Avg latency: 4.369133323666698 seconds
Throughput: 0.93 requests/s, 474.10 tokens/s

@simon-mo simon-mo enabled auto-merge (squash) March 30, 2024 00:35
@WoosukKwon WoosukKwon disabled auto-merge March 30, 2024 00:43
@simon-mo
Copy link
Collaborator Author

Replaced by #3741

@simon-mo simon-mo closed this Mar 30, 2024
@youkaichao youkaichao deleted the revert-3685-fix-tokenizer branch March 30, 2024 02:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant