Add speculative decoding params to lm_bench #4630
Triggered via pull request
November 20, 2024 05:51
Status
Success
Total duration
41m 37s
Artifacts
–
causal_lm_cpp.yml
on: pull_request
Matrix: cpp-beam_search_causal_lm-ubuntu
cpp-multinomial-greedy_causal_lm-ubuntu
17m 37s
cpp-greedy_causal_lm-windows
26m 6s
cpp-greedy_causal_lm-Qwen-7B-Chat
10m 22s
cpp-beam_search_causal_lm-Qwen1_5-7B-Chat
14m 19s
cpp-beam_search_causal_lm-Phi-2
10m 13s
cpp-beam_search_causal_lm-notus-7b-v1
31m 5s
cpp-speculative_decoding_lm-ubuntu
12m 33s
cpp-prompt_lookup_decoding_lm-ubuntu
12m 37s
cpp-Phi-1_5
7m 39s
cpp-greedy_causal_lm-redpajama-3b-chat
14m 31s
cpp-chat_sample-ubuntu
14m 5s
visual_language_chat_sample-ubuntu-minicpm_v2_6
6m 46s
visual_language_chat_sample-ubuntu-llava_1_5
/
visual_language_chat_sample-ubuntu-llava
13m 35s
visual_language_chat_sample-ubuntu-llava_next
/
visual_language_chat_sample-ubuntu-llava
35m 12s
visual_language_chat_sample-ubuntu-internvl2
40m 19s
cpp-continuous-batching-ubuntu
14m 58s
cpp-continuous-batching-windows
23m 0s
cpp-continuous-batching-macos
21m 54s
ci/gha_overall_status_causal_lm
0s
Annotations
20 warnings