Implemented DQN on a self balancing simulated bot using pybullet and openai gym, baselines. The environment is made using ROS URDF and DQN is an off-policy reinforcement learning algorithm which can be used for discrete-action space.
How to run:
- Download the zip
- Create a conda environment and run "pip install -e ."
- To run : python balancebot_task.py