Skip to content

Actions: bittersweet1999/opencompass

deploy

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
273 workflow runs
273 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Feature] Add S3Eval Dataset (#916)
deploy #173: Commit 862044f pushed by bittersweet1999
May 8, 2024 06:27 2s main
May 8, 2024 06:27 2s
fix cmb dataset
deploy #172: Commit e113256 pushed by bittersweet1999
April 28, 2024 14:54 2s fix_cmb
April 28, 2024 14:54 2s
fix prompt template
deploy #170: Commit 1659965 pushed by bittersweet1999
April 28, 2024 13:46 2s fix_flames
April 28, 2024 13:46 2s
adapt to lmdeploy v0.4.0 (#1073)
deploy #169: Commit 1013dce pushed by bittersweet1999
April 28, 2024 13:12 2s main
April 28, 2024 13:12 2s
support arenahard
deploy #168: Commit 86e745d pushed by bittersweet1999
April 26, 2024 07:12 2s arena_hard
April 26, 2024 07:12 2s
support math llm judge
deploy #167: Commit 00b8d63 pushed by bittersweet1999
April 26, 2024 06:43 3s math_openai
April 26, 2024 06:43 3s
support math llm judge
deploy #166: Commit 398411e pushed by bittersweet1999
April 26, 2024 06:41 2s math_openai
April 26, 2024 06:41 2s
support arenahard
deploy #165: Commit 8db3ede pushed by bittersweet1999
April 26, 2024 06:17 2s arena_hard
April 26, 2024 06:17 2s
support arenahard
deploy #164: Commit 5f5b1e0 pushed by bittersweet1999
April 26, 2024 06:03 2s arena_hard
April 26, 2024 06:03 2s
support openai math evaluation
deploy #163: Commit 8ba874c pushed by bittersweet1999
April 25, 2024 15:48 2s math_openai
April 25, 2024 15:48 2s
Add humaneval prompt from simple_evals, openai (#1076)
deploy #162: Commit 41196c4 pushed by bittersweet1999
April 25, 2024 10:54 3s main
April 25, 2024 10:54 3s
[Feature] Add TheoremQA with 5-shot (#1048)
deploy #161: Commit 004ed79 pushed by bittersweet1999
April 23, 2024 02:15 2s main
April 23, 2024 02:15 2s
fix
deploy #160: Commit 35c8965 pushed by bittersweet1999
April 16, 2024 09:49 2s duoxiu
April 16, 2024 09:49 2s
fix multiround
deploy #159: Commit 7e992ba pushed by bittersweet1999
April 12, 2024 07:31 2s duoxiu
April 12, 2024 07:31 2s
[Fix] Update setup.py install_requires (#1036)
deploy #158: Commit bd7c11b pushed by bittersweet1999
April 11, 2024 07:20 3s main
April 11, 2024 07:20 3s
April 2, 2024 07:36 2s
support multi-judge-model
deploy #156: Commit 9383258 pushed by bittersweet1999
April 2, 2024 03:38 3s new_moe
April 2, 2024 03:38 3s
add moe judge
deploy #155: Commit 06e12e6 pushed by bittersweet1999
April 1, 2024 17:27 2s new_moe
April 1, 2024 17:27 2s
[Feature] Support AlpacaEval_V2 (#1006)
deploy #154: Commit 02e7eec pushed by bittersweet1999
April 1, 2024 10:25 3s main
April 1, 2024 10:25 3s
update docs
deploy #153: Commit d8b568a pushed by bittersweet1999
March 28, 2024 08:29 1s alpaca_kaifa
March 28, 2024 08:29 1s
update docs
deploy #152: Commit a621ffb pushed by bittersweet1999
March 28, 2024 08:19 1s alpaca_kaifa
March 28, 2024 08:19 1s
support alpacaeval
deploy #151: Commit a70aa0b pushed by bittersweet1999
March 28, 2024 08:10 2s alpaca_kaifa
March 28, 2024 08:10 2s
support alpacaeval_v2
deploy #150: Commit f265aca pushed by bittersweet1999
March 27, 2024 16:03 2s alpaca_kaifa
March 27, 2024 16:03 2s
[Feature] update needlebench and configs (#986)
deploy #149: Commit 0a6a03f pushed by bittersweet1999
March 26, 2024 02:32 2s main
March 26, 2024 02:32 2s