In Stockholm AI reinforcement learning(RL) group, we discuss RL algorithms and their applications. In order to understand the algorithms fully, we focus on the theoratical and practical details of each algorithm including classic ones and deep ones. This repo is for our internal communication purpose.
We implemented our algorithms using gym and keras and after which we tested the algorithms on the simple cartpole example. The repo is implemented for people to understand the algorithm rather then receiving a good performance on this specific problem. We try to write the code in a clear way.
python cartpole.py
- 0.2.2
- Implemented the Proximal Policy Optimization method
- 0.2.1
- Implemented Monte Carlo method
- 0.2.0
- Fixed SARSA initialization problem
- 0.1.1
- Implemented SARSA
- 0.1.0
- Implemented the tabular Q learning
- 0.0.1
- Work in progress
Alex Yuan Gao – @alexyuangao – [email protected]
Distributed under the MIT license. See LICENSE
for more information.