[Feature] Run OpenAI compatible API server for eval harness #30

PicoCreator · 2023-09-14T02:10:21Z

RWKV-infctx-trainer uses its own inference code within its own codebase here:

https://github.com/RWKV/RWKV-infctx-trainer/blob/main/RWKV-v5/dragon_test.py

This intentionally avoid the rwkv python inference project, as it is meant to be usable for testing/debugging architecture changes (which we may not port over to the inference project). While being terrible unoptimized (its ok!)

For example in rwkv-x-playground branch : https://github.com/RWKV/RWKV-infctx-trainer/tree/rwkv-x-playground
Every single v5XYZ is an experiment, some which will be abandoned.

What we need is to integrate this for evals, as such the following is recommended

creating a script to setup an openAI server
example notebook to run humaneval , eleutherAI evals, or AGIEval
related projects: https://github.com/xiaol/rwkv-hf-lm-evaluation-harness/tree/big-refactor

Ideally we should have options to run only subset of the evals, so we can "fan out the eval runs" across multiple nodes if needed via github CI down the line.

PicoCreator · 2023-09-14T02:12:23Z

The long term goal, is to allow all the top harness to be automatically ranned against a trained model in a CI pipeline

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Run OpenAI compatible API server for eval harness #30

[Feature] Run OpenAI compatible API server for eval harness #30

PicoCreator commented Sep 14, 2023 •

edited

Loading

PicoCreator commented Sep 14, 2023

[Feature] Run OpenAI compatible API server for eval harness #30

[Feature] Run OpenAI compatible API server for eval harness #30

Comments

PicoCreator commented Sep 14, 2023 • edited Loading

PicoCreator commented Sep 14, 2023

PicoCreator commented Sep 14, 2023 •

edited

Loading