Partial preemption for groups with multiple sequences #1312
causal_lm_cpp.yml
on: pull_request
cpp-multinomial-greedy_causal_lm-ubuntu
9m 35s
cpp-greedy_causal_lm-windows
18m 51s
cpp-beam_search_causal_lm-Qwen-7B-Chat
11m 31s
cpp-beam_search_causal_lm-Qwen1_5-7B-Chat
9m 10s
cpp-beam_search_causal_lm-Phi-2
6m 10s
cpp-beam_search_causal_lm-notus-7b-v1
8m 29s
cpp-speculative_decoding_lm-ubuntu
14m 41s
cpp-prompt_lookup_decoding_lm-ubuntu
7m 31s
cpp-Phi-1_5
6m 52s
cpp-greedy_causal_lm-redpajama-3b-chat
12m 33s
cpp-chat_sample-ubuntu
9m 16s
Matrix: cpp-beam_search_causal_lm-ubuntu
Annotations
13 warnings