Skip to content

PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)

Notifications You must be signed in to change notification settings

flxh/pytorch-rdpg

 
 

Repository files navigation

Recurrent Deterministic Policy Gradient (RDPG)

Overview

PyTorch implementation of Recurrent Deterministic Policy Gradient from the paper Memory-based control with recurrent neural networks

Run

  • Training:

    • Pendulum-v0
    $ python main.py --env Pendulum-v0 --max_episode_length 1000 --trajectory_length 10 --debug
  • Testing (TODO)

References:

About

PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%