Skip to content

Simple Reinforcement learning agent that learns balance a pole on cart by moving left or right.

Notifications You must be signed in to change notification settings

VivekChugh/cart_pole

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

I have put together this project over weekend.

Simple Reinforcement learning agent that learns balance a pole on cart by moving left or right.

  • It implements a reinforcement learning agent.
  • learning agent learns to take action (moving cart left or right) based to it prior experiences.
  • It samples batch of previous experiences from a replay memory.
  • It uses primitive two nural-nets, one as policy network and other target network.
  • Follows Epsilon greedy strategy to choose between exploration and exploitation at each time step
  • Updates weights and biases of target net from policy net every 10th episode

NOTE: while sampling experience batch from replay memory and creating state tensors/action tensors, I may have messed up their dimensions before of passing to policy net which causing program to break after 256 time steps in total (when main program accesses the policy net for first time with sampled batch of state-action pairs from replay memory). I'll will be working on it.

About

Simple Reinforcement learning agent that learns balance a pole on cart by moving left or right.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages