- Example: Markov Decision Processes solved with Value Iteration and Policy Iteration (in R)
- Example: Solving a Maze using RL (Value Iteration) (in R)
- Example: A Q-Learning Agent (in R)
- Connection to playing games (Chapter 5): Learning to Play Tic-Tac-Toe with Q-Learning implements a simple table-based Q-learning algorithm to play the game. (Python)
- Example: Solving a Maze using RL (Q-Learning) (in R)
These examples implement methods described in the book Reinforcement Learning: An Introduction by Sutton and Barto (2020).
- Example: Monte Carlo Control (in R)
- Example: TD Control with Sarsa, Q-Learning and Expected Sarsa (in R)
- R package: markovDP
- Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms.
- CleanRL is a Deep Reinforcement Learning library.
All code and documents in this repository is provided under Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License