Skip to content

Actions: intel-analytics/ipex-llm

LLM Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
6,094 workflow runs
6,094 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add precision option in PP inference examples (#11440)
LLM Unit Tests #6690: Commit 508c364 pushed by plusbang
June 27, 2024 01:24 3m 13s main
June 27, 2024 01:24 3m 13s
optimize qwen2 gpu memory usage again (#11435)
LLM Unit Tests #6687: Commit 2a0f808 pushed by MeouSker77
June 26, 2024 08:52 31m 19s main
June 26, 2024 08:52 31m 19s
FIX: Qwen1.5-GPTQ-Int4 inference error (#11432)
LLM Unit Tests #6686: Commit ab9f7f3 pushed by liu-shaojun
June 26, 2024 07:36 31m 31s main
June 26, 2024 07:36 31m 31s
Fix error while using pipeline parallism (#11434)
LLM Unit Tests #6685: Commit 99cd16e pushed by gc-fu
June 26, 2024 07:33 27m 41s main
June 26, 2024 07:33 27m 41s
Fix error while using pipeline parallism
LLM Unit Tests #6683: Pull request #11434 opened by gc-fu
June 26, 2024 06:15 34m 35s gc-fu:fix-vllm-tp-enabling
June 26, 2024 06:15 34m 35s
FIX: Qwen1.5-GPTQ-Int4 inference error
LLM Unit Tests #6682: Pull request #11432 synchronize by liu-shaojun
June 26, 2024 05:15 27m 52s liu-shaojun:qwen2
June 26, 2024 05:15 27m 52s
Fix LLAVA example on CPU (#11271)
LLM Unit Tests #6681: Commit 40fa235 pushed by jenniew
June 26, 2024 03:05 36m 35s main
June 26, 2024 03:05 36m 35s
optimize npu llama perf again (#11431)
LLM Unit Tests #6680: Commit ca0e69c pushed by MeouSker77
June 26, 2024 02:52 6m 44s main
June 26, 2024 02:52 6m 44s
FIX: Qwen1.5-GPTQ-Int4 inference error
LLM Unit Tests #6679: Pull request #11432 synchronize by liu-shaojun
June 26, 2024 02:50 31m 57s liu-shaojun:qwen2
June 26, 2024 02:50 31m 57s
FIX: Qwen1.5-GPTQ-Int4 inference error
LLM Unit Tests #6678: Pull request #11432 opened by liu-shaojun
June 26, 2024 02:35 15m 30s liu-shaojun:qwen2
June 26, 2024 02:35 15m 30s
optimize npu llama perf again
LLM Unit Tests #6677: Pull request #11431 opened by MeouSker77
June 26, 2024 02:18 30m 50s MeouSker77:optimize-npu-llama-again
June 26, 2024 02:18 30m 50s
optimize llama npu perf (#11426)
LLM Unit Tests #6675: Commit 9f6e5b4 pushed by MeouSker77
June 25, 2024 09:43 45m 1s main
June 25, 2024 09:43 45m 1s
optimize llama npu perf
LLM Unit Tests #6674: Pull request #11426 opened by MeouSker77
June 25, 2024 09:37 31m 50s MeouSker77:optimize-llama-npu-perf
June 25, 2024 09:37 31m 50s
Add more qwen1.5 and qwen2 support for pipeline parallel inference (#…
LLM Unit Tests #6673: Commit e473b8d pushed by plusbang
June 25, 2024 07:49 31m 38s main
June 25, 2024 07:49 31m 38s
Fix shape error when run qwen1.5-14b using deepspeed autotp (#11420)
LLM Unit Tests #6671: Commit aacc1fd pushed by plusbang
June 25, 2024 05:48 3m 34s main
June 25, 2024 05:48 3m 34s
update npu examples (#11422)
LLM Unit Tests #6670: Commit 3b23de6 pushed by MeouSker77
June 25, 2024 05:32 8m 12s main
June 25, 2024 05:32 8m 12s
LLM: Refactor Pipeline-Parallel-FastAPI example (#11319)
LLM Unit Tests #6669: Commit 8ddae22 pushed by xiangyuT
June 25, 2024 05:30 51m 36s main
June 25, 2024 05:30 51m 36s
update npu examples
LLM Unit Tests #6668: Pull request #11422 opened by MeouSker77
June 25, 2024 05:30 28m 56s MeouSker77:update-npu-example
June 25, 2024 05:30 28m 56s