Skip to content

Actions: KuntaiDu/vllm

yapf

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
25 workflow runs
25 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

October 21, 2024 21:56 2m 32s
[torch.compile] generic decorators (#9258)
yapf #24: Commit e00c094 pushed by KuntaiDu
October 10, 2024 23:03 2m 40s main
October 10, 2024 23:03 2m 40s
[misc] add forward context for attention (#9029)
yapf #23: Commit 9aaf14c pushed by KuntaiDu
October 3, 2024 21:29 2m 40s main
October 3, 2024 21:29 2m 40s
[Core] CUDA Graphs for Multi-Step + Chunked-Prefill (#8645)
yapf #22: Commit afb050b pushed by KuntaiDu
October 2, 2024 21:15 2m 28s main
October 2, 2024 21:15 2m 28s
[Core] Priority-based scheduling in async engine (#8850)
yapf #21: Commit bd429f2 pushed by KuntaiDu
September 28, 2024 01:00 2m 23s main
September 28, 2024 01:00 2m 23s
[bugfix] [AMD] add multi-step advance_step to ROCmFlashAttentionMetad…
yapf #20: Commit 9e5ec35 pushed by KuntaiDu
September 20, 2024 04:54 2m 16s main
September 20, 2024 04:54 2m 16s
[Bugfix] [Encoder-Decoder] Bugfix for encoder specific metadata const…
yapf #19: Commit 3118f63 pushed by KuntaiDu
September 19, 2024 03:46 2m 16s main
September 19, 2024 03:46 2m 16s
[Bugfix][Model] Fix Python 3.8 compatibility in Pixtral model by upda…
yapf #18: Commit 3724d5f pushed by KuntaiDu
September 15, 2024 17:10 1m 58s main
September 15, 2024 17:10 1m 58s
[misc][ci] fix cpu test with plugins (#7489)
yapf #17: Commit ea49e6a pushed by KuntaiDu
August 14, 2024 03:14 1m 47s main
August 14, 2024 03:14 1m 47s
[CI/Build] Minor refactoring for vLLM assets (#7407)
yapf #16: Commit 86ab567 pushed by KuntaiDu
August 12, 2024 04:55 1m 43s main
August 12, 2024 04:55 1m 43s
[LoRA] ReplicatedLinear support LoRA (#7081)
yapf #15: Commit 99d7cab pushed by KuntaiDu
August 3, 2024 06:46 1m 42s main
August 3, 2024 06:46 1m 42s
July 27, 2024 21:20 1m 39s
[Bugfix] Add synchronize to prevent possible data race (#6788)
yapf #13: Commit 95db75d pushed by KuntaiDu
July 25, 2024 18:40 1m 32s main
July 25, 2024 18:40 1m 32s
[Bugfix] Fix illegal memory access in FP8 MoE kernel (#6382)
yapf #12: Commit 75f64d8 pushed by KuntaiDu
July 12, 2024 22:50 1m 32s main
July 12, 2024 22:50 1m 32s
[Doc] Reorganize Supported Models by Type (#6167)
yapf #11: Commit 175c43e pushed by KuntaiDu
July 6, 2024 07:31 1m 25s main
July 6, 2024 07:31 1m 25s
[Bugfix] Fix compute_logits in Jamba (#6093)
yapf #10: Commit 7cd2ebb pushed by KuntaiDu
July 3, 2024 07:34 1m 28s main
July 3, 2024 07:34 1m 28s
June 18, 2024 22:39 1m 14s
June 18, 2024 00:15 1m 30s
June 9, 2024 07:49 1m 22s
[Kernel] Retune Mixtral 8x22b configs for FP8 on H100 (#5294)
yapf #5: Commit abe855d pushed by KuntaiDu
June 6, 2024 19:21 1m 13s main
June 6, 2024 19:21 1m 13s
[Misc] Make Serving Benchmark More User-friendly (#5044)
yapf #4: Commit f17a1a8 pushed by KuntaiDu
May 27, 2024 21:38 1m 14s main
May 27, 2024 21:38 1m 14s
Remove marlin warning (#4918)
yapf #3: Commit da5a0b5 pushed by KuntaiDu
May 20, 2024 17:28 1m 12s main
May 20, 2024 17:28 1m 12s
[Bugfix] Add logs for all model dtype casting (#4717)
yapf #2: Commit be0c518 pushed by KuntaiDu
May 9, 2024 21:05 1m 6s main
May 9, 2024 21:05 1m 6s
May 8, 2024 21:57 1m 3s