tiny_stories_rl

Uses reinforcement learning to encourage roneneldan/TinyStories-33M to generate stories with alliteration

Docs are here

Installtion

If you install uv, it'll get the dependencies.

Backup plan: ./build.sh and ./run.sh will build and run a Docker container that has uv, in case your system is weird (like my NixOS laptop) and doesn't work with uv. Once you're in the container, you can run the commands in the following sections.

Usage

uv run src/tiny_stories_rl/train.py

The KL penalty coeffient is configurable via --kl-coefficient; see here for more.

Running tests

uv run pytest tests

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
docs		docs
src/tiny_stories_rl		src/tiny_stories_rl
tests		tests
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
docker_name		docker_name
pyproject.toml		pyproject.toml
run.sh		run.sh
run_tb.sh		run_tb.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tiny_stories_rl

Installtion

Usage

Running tests

About

Releases

Packages

Languages

License

TheodoreEhrenborg/tiny_stories_rl

Folders and files

Latest commit

History

Repository files navigation

tiny_stories_rl

Installtion

Usage

Running tests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages