This repository aims to provide an introduction series to reinforcement learning (RL) by delivering a walkthough on how to code different RL techniques.
A quick background review of RL is available here.
- Tutorial 1: Q-learning
- Tutorial 2: SARSA
- Tutorial 3: Exploring OpenAI gym
- Tutorial 4: Q-learning in OpenAI gym
- Tutorial 5: Deep Q-learning (DQN)
- Tutorial 6: Deep Convolutional Q-learning
- Tutorial 7: Reinforcement Learning with ROS and Gazebo
Tutorial 8: Reinforcement Learning in DOOM(unfinished) - Tutorial 9: Deep Deterministic Policy Gradients (DDPG)
Tutorial 10: Guided Policy Search (GPS)(unfinished) - Tutorial 11: A review of different AI techniques for RL (WIP)
- Tutorial 12: Reviewing Policy Gradient methods
Tutorial 13: Continuous-state spaces with DQN(merged) - Tutorial 14: Benchmarking RL techniques
Tutorial 15: Reviewing Vanilla Policy Gradient (VPG)(failed miserably)
- Chris Watkins, Learning from Delayed Rewards, Cambridge, 1989 (thesis)
- Awesome Reinforcement Learning repository,
- Reinforcement learning CS9417ML, School of Computer Science & Engineering, UNSW Sydney,
- Reinforcement learning blog posts,
- OpenAI gym docs,
- Vincent Bons implementations,
- David Silver's Deep Reinforcement Learning talk,
- Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., & Zaremba, W. (2016). OpenAI Gym. arXiv preprint arXiv:1606.01540.