Skip to content

Actions: intel-analytics/ipex-llm

LLM Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
6,094 workflow runs
6,094 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add experimental ov backend to NPU model
LLM Unit Tests #6642: Pull request #11383 synchronize by yangw1234
June 21, 2024 05:36 7m 33s yangw1234:ov_npu
June 21, 2024 05:36 7m 33s
Fix vLLM CPU api_server params (#11384)
LLM Unit Tests #6641: Commit b30bf76 pushed by xiangyuT
June 21, 2024 05:00 1h 34m 51s main
June 21, 2024 05:00 1h 34m 51s
Add GLM-4V example (#11343)
LLM Unit Tests #6640: Commit 21fc781 pushed by Oscilloscope98
June 21, 2024 04:54 1h 17m 6s main
June 21, 2024 04:54 1h 17m 6s
add glm-4v-9b example
LLM Unit Tests #6639: Pull request #11343 synchronize by JinBridger
June 21, 2024 02:44 3h 2m 55s glm-4v-9b
June 21, 2024 02:44 3h 2m 55s
add glm-4v-9b example
LLM Unit Tests #6638: Pull request #11343 synchronize by JinBridger
June 21, 2024 02:25 19m 42s glm-4v-9b
June 21, 2024 02:25 19m 42s
Support PP inference for chatglm3 (#11375)
LLM Unit Tests #6637: Commit 4ba8219 pushed by plusbang
June 21, 2024 01:59 3h 23m 22s main
June 21, 2024 01:59 3h 23m 22s
LLM: Refactor Pipeline-Parallel-FastAPI example
LLM Unit Tests #6636: Pull request #11319 synchronize by xiangyuT
June 21, 2024 01:37 3h 21m 37s xiangyuT:pp_example_merge_0614
June 21, 2024 01:37 3h 21m 37s
LLM: Refactor Pipeline-Parallel-FastAPI example
LLM Unit Tests #6635: Pull request #11319 synchronize by xiangyuT
June 21, 2024 01:36 5m 37s xiangyuT:pp_example_merge_0614
June 21, 2024 01:36 5m 37s
Fix vLLM CPU api_server params
LLM Unit Tests #6634: Pull request #11384 opened by xiangyuT
June 21, 2024 00:59 3h 34m 59s xiangyuT:fix_vllm_cpu_entrypoint_0621
June 21, 2024 00:59 3h 34m 59s
Fix 1383 Llama model on transformers=4.41[WIP]
LLM Unit Tests #6633: Pull request #11280 synchronize by songhappy
June 21, 2024 00:41 3h 29m 17s songhappy:fix_1383
June 21, 2024 00:41 3h 29m 17s
Fix 1383 Llama model on transformers=4.41[WIP]
LLM Unit Tests #6632: Pull request #11280 synchronize by songhappy
June 21, 2024 00:38 3m 45s songhappy:fix_1383
June 21, 2024 00:38 3m 45s
Fix 1383 Llama model on transformers=4.41[WIP]
LLM Unit Tests #6631: Pull request #11280 synchronize by songhappy
June 21, 2024 00:33 4m 44s songhappy:fix_1383
June 21, 2024 00:33 4m 44s
Add experimental ov backend to NPU model
LLM Unit Tests #6630: Pull request #11383 synchronize by yangw1234
June 20, 2024 22:08 5h 38m 30s yangw1234:ov_npu
June 20, 2024 22:08 5h 38m 30s
Add experimental ov backend to NPU model
LLM Unit Tests #6629: Pull request #11383 synchronize by yangw1234
June 20, 2024 21:04 1h 4m 11s yangw1234:ov_npu
June 20, 2024 21:04 1h 4m 11s
Add experimental ov backend to NPU model
LLM Unit Tests #6628: Pull request #11383 opened by yangw1234
June 20, 2024 21:02 2m 22s yangw1234:ov_npu
June 20, 2024 21:02 2m 22s
Fix LLAVA example on CPU
LLM Unit Tests #6627: Pull request #11271 synchronize by jenniew
June 20, 2024 18:22 24m 7s jenniew:fix_llava
June 20, 2024 18:22 24m 7s
Fix LLAVA example on CPU
LLM Unit Tests #6626: Pull request #11271 synchronize by jenniew
June 20, 2024 18:20 2m 27s jenniew:fix_llava
June 20, 2024 18:20 2m 27s
Optimize qwen 1.5 14B batch performance (#11370)
LLM Unit Tests #6623: Commit f0fdfa0 pushed by MeouSker77
June 20, 2024 09:23 15h 57m 48s main
June 20, 2024 09:23 15h 57m 48s
Add more examples for pipeline parallel inference
LLM Unit Tests #6622: Pull request #11372 synchronize by sgwhat
June 20, 2024 09:14 15h 43m 11s sgwhat:pp_add_examples
June 20, 2024 09:14 15h 43m 11s
Support PP inference for chatglm3
LLM Unit Tests #6621: Pull request #11375 synchronize by plusbang
June 20, 2024 08:11 16h 3m 51s plusbang:support-glm3-pp
June 20, 2024 08:11 16h 3m 51s
Support PP inference for chatglm3
LLM Unit Tests #6620: Pull request #11375 synchronize by plusbang
June 20, 2024 07:23 31m 35s plusbang:support-glm3-pp
June 20, 2024 07:23 31m 35s
Support PP inference for chatglm3
LLM Unit Tests #6619: Pull request #11375 opened by plusbang
June 20, 2024 07:16 8m 19s plusbang:support-glm3-pp
June 20, 2024 07:16 8m 19s
Add qwen-moe batch1 to nightly perf (#11369)
LLM Unit Tests #6618: Commit c0e86c5 pushed by hkvision
June 20, 2024 06:17 52m 52s main
June 20, 2024 06:17 52m 52s
Add more examples for pipeline parallel inference
LLM Unit Tests #6616: Pull request #11372 opened by sgwhat
June 20, 2024 06:10 36m 16s sgwhat:pp_add_examples
June 20, 2024 06:10 36m 16s