vaibkumr / RL_from_scratch Public

Notifications You must be signed in to change notification settings
Fork 6
Star 8

This is where I write RL related stuff from scratch

8 stars 6 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
model_dependent		model_dependent
model_independent		model_independent
README.md		README.md
random_walk.py		random_walk.py

Repository files navigation

Reinforcement learning

This is where I write various RL algorithms in python.
Will keep updating as I keep learning more.
Will try to document as well as I can

Currently my only source for learning is "Reinforcement Learning: An Introduction book by Andrew Barto and Richard S. Sutton"

Progress

random walk from Sutton and Barto's book
Model dependent
- Value iteration
- Policy iteration
Model independent
- Q-learning (TD0)
- SARSA (TD0)
- Expected SARSA (TD0)
- TD-lambda methods
- k-step SARSA

TODO

Gauss-Seidel Value iteration
Asynchronous Value iteration
More from here

About

This is where I write RL related stuff from scratch

Report repository

Releases

No releases published

Packages

No packages published

Languages