Skip to content

[Core][2/N] Model runner refactoring part 2. Combine prepare prefill / decode to a single API#4681

Merged
rkooo567 merged 49 commits intovllm-project:mainfrom rkooo567:model-runner-refactoring-coelsceMay 15, 2024

Commits

Commits on May 8, 2024

Commits on May 9, 2024

Commits on May 10, 2024

Commits on May 13, 2024

Commits on May 14, 2024

Commits on May 15, 2024