Skip to content

Actions: zhouyu5/vllm-fork

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
10 workflow runs
10 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[BUGFIX]fix FP8 failing issue on habana_main [PatchedVLLMKVCache fwd …
clang-format #10: Commit c79982d pushed by zhouyu5
November 19, 2024 00:27 15s habana_main
November 19, 2024 00:27 15s
Create scorecard.yml (#431)
clang-format #9: Commit 6643aa6 pushed by zhouyu5
November 3, 2024 23:56 41s habana_main
November 3, 2024 23:56 41s
Fix default value for FSDPA (#448)
clang-format #8: Commit 94858b5 pushed by zhouyu5
October 30, 2024 13:21 25s habana_main
October 30, 2024 13:21 25s
HPU: offload logits processing to CPU (#358)
clang-format #7: Commit 3203bd9 pushed by zhouyu5
October 29, 2024 07:27 19s habana_main
October 29, 2024 07:27 19s
Remove redundant set_active_loras call during warmup (#413)
clang-format #6: Commit 3af4b6c pushed by zhouyu5
October 23, 2024 07:05 14s habana_main
October 23, 2024 07:05 14s
Remove CPU sync before Sampler (#414)
clang-format #5: Commit 0cf5261 pushed by zhouyu5
October 22, 2024 11:44 49s habana_main
October 22, 2024 11:44 49s
Make weights_load_device not change EngineArgs.create_load_config() (…
clang-format #4: Commit 9111a80 pushed by zhouyu5
September 25, 2024 02:16 54s habana_main
September 25, 2024 02:16 54s
Compile mode bug fix for LoRA (#196)
clang-format #3: Commit fdf3fd8 pushed by zhouyu5
August 22, 2024 08:17 50s habana_main
August 22, 2024 08:17 50s
split gptbigcode forward (#194)
clang-format #2: Commit f7dd91d pushed by zhouyu5
August 19, 2024 13:57 15s habana_main
August 19, 2024 13:57 15s
Fix logger initialization in ops.py (#178)
clang-format #1: Commit c098433 pushed by zhouyu5
August 14, 2024 06:56 31s habana_main
August 14, 2024 06:56 31s