Name		Name	Last commit message	Last commit date
parent directory ..
_AIMA-4x3-gridworld_files		_AIMA-4x3-gridworld_files
figures		figures
.Rhistory		.Rhistory
MC-Control.html		MC-Control.html
MC-Control.qmd		MC-Control.qmd
MDP.html		MDP.html
MDP.qmd		MDP.qmd
QLearning.html		QLearning.html
QLearning.qmd		QLearning.qmd
README.md		README.md
RL-Maze.html		RL-Maze.html
RL-Maze.qmd		RL-Maze.qmd
RL.Rproj		RL.Rproj
TD-Control.html		TD-Control.html
TD-Control.qmd		TD-Control.qmd
_AIMA-4x3-gridworld.html		_AIMA-4x3-gridworld.html
_AIMA-4x3-gridworld.qmd		_AIMA-4x3-gridworld.qmd
tictactoe.py		tictactoe.py
tictactoe_RL.ipynb		tictactoe_RL.ipynb

README.md

Chapters 17 and 22: Reinforcement Learning and MDPs

Chapter 17: MDPs

Example: Markov Decision Processes solved with Value Iteration and Policy Iteration (in R)
Example: Solving a Maze using RL (Value Iteration) (in R)

Chapter 22: Reinforcement Learning

Example: A Q-Learning Agent (in R)
Connection to playing games (Chapter 5): Learning to Play Tic-Tac-Toe with Q-Learning implements a simple table-based Q-learning algorithm to play the game. (Python)
Example: Solving a Maze using RL (Q-Learning) (in R)

More on Reinforcement Learning

These examples implement methods described in the book Reinforcement Learning: An Introduction by Sutton and Barto (2020).

Example: Monte Carlo Control (in R)
Example: TD Control with Sarsa, Q-Learning and Expected Sarsa (in R)
R package: markovDP

Other Software (Python)

Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms.
CleanRL is a Deep Reinforcement Learning library.

License

All code and documents in this repository is provided under Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License