DDPG

Reimplementing DDPG from Continuous Control with Deep Reinforcement Learning based on OpenAI Gym and Tensorflow

It is still a problem to implement Batch Normalization on the critic network. However the actor network works well with Batch Normalization.

Some Mujoco environments are still unsolved on OpenAI Gym.

Some Evaluations

git clone https://github.com/songrotek/DDPG.git
cd DDPG
python gym_ddpg.py

If you want to change the Gym environment, change ENV_NAME in gym_ddpg.py.

If you want to change the Network type, change import in ddpg.py such as

from actor_network_bn import ActorNetwork
to
from actor_network import ActorNetwork

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE.txt		LICENSE.txt
README.md		README.md
actor_network.py		actor_network.py
actor_network_bn.py		actor_network_bn.py
critic_network.py		critic_network.py
critic_network_bn.py		critic_network_bn.py
ddpg.py		ddpg.py
filter_env.py		filter_env.py
gym_ddpg.py		gym_ddpg.py
ou_noise.py		ou_noise.py
replay_buffer.py		replay_buffer.py