Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,459 workflow runs
2,459 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

CI for: fix crash in torch2.6 if TP=1
Server Tests #3618: Pull request #2898 opened by danieldk
January 10, 2025 15:01 7m 31s distributed-fix
January 10, 2025 15:01 7m 31s
CI for: chore: Update jsonschema to 0.28.0
Server Tests #3617: Pull request #2893 synchronize by danieldk
January 10, 2025 11:47 9m 0s update-jsonschema
January 10, 2025 11:47 9m 0s
Update to marlin-kernels 0.3.7
Server Tests #3616: Pull request #2882 synchronize by danieldk
January 9, 2025 15:29 8m 22s marlin-kernels-0.3.7
January 9, 2025 15:29 8m 22s
Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu
Server Tests #3615: Pull request #2815 synchronize by sywangyi
January 9, 2025 13:27 Action required sywangyi:flash_decoding
January 9, 2025 13:27 Action required
CI for: chore: Update jsonschema to 0.28.0
Server Tests #3613: Pull request #2893 opened by danieldk
January 9, 2025 09:16 4m 42s update-jsonschema
January 9, 2025 09:16 4m 42s
Basic flashinfer 0.2 support
Server Tests #3612: Pull request #2862 synchronize by danieldk
January 9, 2025 08:28 8m 9s flashinfer-0.2
January 9, 2025 08:28 8m 9s
Basic flashinfer 0.2 support
Server Tests #3611: Pull request #2862 synchronize by danieldk
January 8, 2025 14:50 8m 18s flashinfer-0.2
January 8, 2025 14:50 8m 18s
Improve vlm support (add idefics3 support)
Server Tests #3610: Pull request #2437 synchronize by drbh
January 8, 2025 13:49 6m 27s improve-vlm-support
January 8, 2025 13:49 6m 27s
Basic flashinfer 0.2 support
Server Tests #3609: Pull request #2862 synchronize by danieldk
January 8, 2025 10:06 8m 17s flashinfer-0.2
January 8, 2025 10:06 8m 17s
feat: improve qwen2-vl startup
Server Tests #3608: Pull request #2802 synchronize by drbh
January 7, 2025 22:36 6m 33s improve-qwen2-vl-warmup
January 7, 2025 22:36 6m 33s
Improve vlm support (add idefics3 support)
Server Tests #3607: Pull request #2437 synchronize by drbh
January 7, 2025 22:25 6m 28s improve-vlm-support
January 7, 2025 22:25 6m 28s
Improve vlm support (add idefics3 support)
Server Tests #3606: Pull request #2437 synchronize by drbh
January 7, 2025 22:19 5m 41s improve-vlm-support
January 7, 2025 22:19 5m 41s
Improve vlm support (add idefics3 support)
Server Tests #3605: Pull request #2437 synchronize by drbh
January 7, 2025 22:07 6m 21s improve-vlm-support
January 7, 2025 22:07 6m 21s
Improve vlm support (add idefics3 support)
Server Tests #3604: Pull request #2437 synchronize by drbh
January 7, 2025 22:05 1m 10s improve-vlm-support
January 7, 2025 22:05 1m 10s
Add fp8 kv cache for ROCm
Server Tests #3603: Pull request #2856 synchronize by mht-sharma
January 7, 2025 07:20 8m 17s fp8_kvcache_rocm
January 7, 2025 07:20 8m 17s
fix crash in torch2.6 if TP=1
Server Tests #3602: Pull request #2885 opened by sywangyi
January 7, 2025 06:01 Action required sywangyi:torch2.6_pg
January 7, 2025 06:01 Action required
feat: improve star coder to support multi lora layers
Server Tests #3601: Pull request #2883 opened by drbh
January 7, 2025 00:23 8m 27s startcoder-support-multi-lora
January 7, 2025 00:23 8m 27s
Basic flashinfer 0.2 support
Server Tests #3600: Pull request #2862 synchronize by danieldk
January 6, 2025 16:08 7m 7s flashinfer-0.2
January 6, 2025 16:08 7m 7s
Update to marlin-kernels 0.3.7
Server Tests #3599: Pull request #2882 opened by danieldk
January 6, 2025 16:03 8m 26s marlin-kernels-0.3.7
January 6, 2025 16:03 8m 26s
Enable qwen2vl video
Server Tests #3598: Pull request #2756 synchronize by drbh
January 3, 2025 16:01 8m 35s enable-qwen2vl-video
January 3, 2025 16:01 8m 35s
Add fp8 kv cache for ROCm
Server Tests #3597: Pull request #2856 synchronize by mht-sharma
January 3, 2025 11:58 8m 30s fp8_kvcache_rocm
January 3, 2025 11:58 8m 30s
Enable qwen2vl video
Server Tests #3595: Pull request #2756 synchronize by drbh
December 23, 2024 18:47 9m 2s enable-qwen2vl-video
December 23, 2024 18:47 9m 2s
Improve vlm support (add idefics3 support)
Server Tests #3594: Pull request #2437 synchronize by drbh
December 23, 2024 14:40 8m 46s improve-vlm-support
December 23, 2024 14:40 8m 46s
Basic flashinfer 0.2 support
Server Tests #3593: Pull request #2862 synchronize by danieldk
December 22, 2024 13:04 6m 51s flashinfer-0.2
December 22, 2024 13:04 6m 51s
Basic flashinfer 0.2 support
Server Tests #3592: Pull request #2862 opened by danieldk
December 22, 2024 12:24 9m 2s flashinfer-0.2
December 22, 2024 12:24 9m 2s