Skip to content

Actions: huggingface/text-generation-inference

Automatic Documentation for Launcher

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,911 workflow runs
1,911 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Use GPTQ-Marlin for supported GPTQ configurations
Automatic Documentation for Launcher #113: Pull request #2111 synchronize by danieldk
June 27, 2024 07:53 1m 21s feature/use-gptq-marlin-for-gptq
June 27, 2024 07:53 1m 21s
Fixing prom leak by upgrading.
Automatic Documentation for Launcher #112: Pull request #2129 opened by Narsil
June 27, 2024 06:05 1m 16s fix_prom_leak
June 27, 2024 06:05 1m 16s
feat: use model name as adapter id in chat endpoints
Automatic Documentation for Launcher #111: Pull request #2128 opened by drbh
June 26, 2024 23:25 1m 20s enable-adapter-id-in-chat
June 26, 2024 23:25 1m 20s
fix: prefer serde structs over custom functions
Automatic Documentation for Launcher #110: Pull request #2127 opened by drbh
June 26, 2024 22:57 1m 22s prefer-chat-object-enum
June 26, 2024 22:57 1m 22s
Fixing AMD CI
Automatic Documentation for Launcher #109: Pull request #2109 synchronize by fxmarty
June 26, 2024 13:57 1m 17s ci_amd3
June 26, 2024 13:57 1m 17s
Using new cache.
Automatic Documentation for Launcher #108: Pull request #2125 opened by Narsil
June 26, 2024 13:22 1m 17s ci2
ci2
June 26, 2024 13:22 1m 17s
Ci test
Automatic Documentation for Launcher #107: Pull request #2124 opened by glegendre01
June 26, 2024 12:25 1m 17s ci-test
June 26, 2024 12:25 1m 17s
Fixing AMD CI
Automatic Documentation for Launcher #106: Pull request #2109 synchronize by fxmarty
June 26, 2024 10:58 1m 17s ci_amd3
June 26, 2024 10:58 1m 17s
Fixing AMD CI
Automatic Documentation for Launcher #105: Pull request #2109 synchronize by fxmarty
June 26, 2024 10:44 1m 23s ci_amd3
June 26, 2024 10:44 1m 23s
Fixing AMD CI
Automatic Documentation for Launcher #104: Pull request #2109 synchronize by fxmarty
June 26, 2024 10:08 1m 20s ci_amd3
June 26, 2024 10:08 1m 20s
Use symmetric quantization in the quantize subcommand
Automatic Documentation for Launcher #102: Pull request #2120 opened by danieldk
June 26, 2024 07:57 1m 16s bugfix/quantize-use-sym
June 26, 2024 07:57 1m 16s
fix: simplify kserve endpoint and fix imports
Automatic Documentation for Launcher #101: Pull request #2119 opened by drbh
June 25, 2024 21:17 1m 19s kserve-interface-and-update-adjustments
June 25, 2024 21:17 1m 19s
Enable multiple LoRa adapters
Automatic Documentation for Launcher #100: Pull request #2010 synchronize by drbh
June 25, 2024 16:23 1m 17s lora-internal
June 25, 2024 16:23 1m 17s
Fix CI .
Automatic Documentation for Launcher #99: Pull request #2118 opened by Narsil
June 25, 2024 15:28 1m 26s fix_ci
June 25, 2024 15:28 1m 26s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #98: Pull request #1940 synchronize by Narsil
June 25, 2024 15:07 1m 21s flashdecoding
June 25, 2024 15:07 1m 21s
Support AWQ quantization with bias
Automatic Documentation for Launcher #97: Pull request #2117 synchronize by danieldk
June 25, 2024 15:02 1m 17s bugfix/awq-with-bias
June 25, 2024 15:02 1m 17s
Support AWQ quantization with bias
Automatic Documentation for Launcher #96: Pull request #2117 opened by danieldk
June 25, 2024 14:59 1m 17s bugfix/awq-with-bias
June 25, 2024 14:59 1m 17s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #95: Pull request #1940 synchronize by Narsil
June 25, 2024 14:49 1m 16s flashdecoding
June 25, 2024 14:49 1m 16s
Fix nccl regression on PyTorch 2.3 upgrade
Automatic Documentation for Launcher #93: Pull request #2099 synchronize by fxmarty
June 25, 2024 14:28 1m 16s fix-nccl-regression
June 25, 2024 14:28 1m 16s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #92: Pull request #1940 synchronize by Narsil
June 25, 2024 14:24 1m 18s flashdecoding
June 25, 2024 14:24 1m 18s
Add pytest release marker
Automatic Documentation for Launcher #91: Pull request #2114 synchronize by danieldk
June 25, 2024 13:32 1m 16s ci/release-tests
June 25, 2024 13:32 1m 16s
Idefics2: sync added image tokens with transformers
Automatic Documentation for Launcher #90: Pull request #2080 synchronize by danieldk
June 25, 2024 13:13 1m 26s bugfix/idefics2-no-image-splitting
June 25, 2024 13:13 1m 26s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #89: Pull request #1940 synchronize by Narsil
June 25, 2024 13:10 1m 20s flashdecoding
June 25, 2024 13:10 1m 20s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #88: Pull request #1940 synchronize by Narsil
June 25, 2024 12:24 1m 19s flashdecoding
June 25, 2024 12:24 1m 19s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #87: Pull request #1940 synchronize by Narsil
June 25, 2024 12:21 1m 20s flashdecoding
June 25, 2024 12:21 1m 20s
ProTip! You can narrow down the results and go further in time using created:<2024-06-25 or the other filters available.