Generative Artificial Intelligence Research Lab (GAIR)
Pinned Loading
Repositories
Showing 10 of 28 repositories
- math-evaluation-harness Public Forked from ZubinGou/math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
GAIR-NLP/math-evaluation-harness’s past year of commit activity - OlympicArena Public
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
GAIR-NLP/OlympicArena’s past year of commit activity - weak-to-strong-reasoning Public
GAIR-NLP/weak-to-strong-reasoning’s past year of commit activity