Skip to content

Commit

Permalink
results update; common hf pipeline
Browse files Browse the repository at this point in the history
  • Loading branch information
yuchenlin committed Jul 15, 2024
1 parent af24bf7 commit a5614a2
Show file tree
Hide file tree
Showing 14 changed files with 1,774 additions and 1,343 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -225,11 +225,11 @@ To analyze the correlation between WildBench (v2) and human evaluation, we consi
## Todos

### Models pending to test

- [ ] openchat/openchat-3.6-8b-20240522
- [ ] gemma-2
- [ ] SimPO-v0.2
- [ ] Qwen2-7B-Chat
- [ ] LLM360/K2-Chat
- [x] LLM360/K2-Chat
- [x] DeepSeek-V2-Code
- [x] Yi-large-preview
- [x] THUDM/glm-4-9b-chat
Expand Down
Loading

0 comments on commit a5614a2

Please sign in to comment.