v0.2.0 - baseline reinforcement learning algorithms Milestone · GitHub

New issue

v0.2.0 - baseline reinforcement learning algorithms

No due date 0% complete

(envs) Possibly split reinforce into third-party dependencies (ie: -envs-gym, -envs-atari)
(envs) Introduce some kind of History monad to cleanly seperate debug logging from agent or environment reporting
(algos) write out more baseline algorithms in reinforce-algorithms (in progress)
(algos) modify project structure to so that reinforce and reinforce-al…

(envs) Possibly split reinforce into third-party dependencies (ie: -envs-gym, -envs-atari)
(envs) Introduce some kind of History monad to cleanly seperate debug logging from agent or environment reporting
(algos) write out more baseline algorithms in reinforce-algorithms (in progress)
(algos) modify project structure to so that reinforce and reinforce-algorithms are siblings

No hackage release with this. This would basically prototype a reinforce-algorithms project.