Reinforcement_Learning

Implementation of basic RL algorithms:

Policy iteration and value iteration algorithms are implemented based on the board below

policy.out, value.out contains the output of running executables policy, value which are compiled from policy_iteration.cc and value_iteration.cc.

Provide feedback