Experiments and implementations of various reinforcement learning algorithms. References Sutton & Barto.
An experiment to demonstrate the difficulties that sample-average methods have for non-stationary problems.
Sutton, Richard S., and Andrew G. Barto. Reinforcement learning: An introduction