Implementation of basic RL algorithms:
- Policy Iteration in c++
- Value Iteration in c++
Policy iteration and value iteration algorithms are implemented based on the board below
policy.out, value.out contains the output of running executables policy, value which are compiled from policy_iteration.cc and value_iteration.cc.