Skip to content

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Notifications You must be signed in to change notification settings

tgangwani/RL-Indirect-imitation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repo contains code for our paper State-only Imitation with Transition Dynamics Mismatch published at ICLR 2020.

The code heavily uses the RL machinery from this awesome repository with RL algorithms implemented in PyTorch. We also use some functionality from OpenAI baselines. The code was tested with the following packages:

  • python 3.6.6
  • pytorch 0.4.1
  • gym 0.10.8

Running command

To run MuJoCo experiments, use the command below. Update the path to the directory with the expert demonstrations. The demonstrations used in our experiments can be downloaded from this Google drive link. Edit default_config.yaml to change the hyperparameters.

python main.py --env-name "Hopper-v2" --config-file "default_config.yaml" --experts-dir <add-path-here> --seed=$RANDOM

Credits

  1. ikostrikov/pytorch-a2c-ppo-acktr-gail
  2. OpenAI baselines

About

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published