JoshuaPurtell

Follow

💭

Working

Josh Purtell JoshuaPurtell

💭

Working

Follow

AI Agent Research

41 followers · 104 following

Achievements

Achievements

Pinned Loading

Apropos Apropos Public

A framework for rapidly building compound AI systems

Python 4
craftaxlm craftaxlm Public

A wrapper around the Craftax agent benchmark, for evaluating digital agents over extremely long time horizons

Python 1
LRCBench LRCBench Public

Evals meant to evaluate language models' ability to reason over long contexts.

Python 8
SmallBench SmallBench Public

Small, simple agent task environments for training and evaluation

Python 17
icl-bench icl-bench Public

Evaluating Language Models' Ability to Learn In Context

Python
jazyk jazyk Public

Simple LM api for production

Python