TCRL: Simplified Temporal Consistency Reinforcement Learning

This is the PyTorch implementation of Simplified Temporal Consistency Reinforcement Learning (TCRL).

Method

TCRL shows that, a simple representation learning approach relying only on a latent dynamics model trained by latent temporal consistency is sufficient for high-performance RL. This applies when using pure planning with a dynamics model conditioned on the representation, but, also when utilizing the representation as policy and value function features in model-free RL. In experiments, our approach learns an accurate dynamics model to solve challenging high-dimensional locomotion tasks with online planners while being 4.1× faster to train compared to ensemble-based methods. With model-free RL without planning, especially on high-dimensional tasks, such as the DeepMind Control Suite Humanoid and Dog tasks, our approach outperforms model-free methods by a large margin and matches model-based methods’ sample efficiency while training 2.4× faster.

Instructions

Install dependencies

conda env create -f environment.yaml
conda activate tcrl

Train the agent

python main.py task=walker_walk

To log metircs with wandb

python main.py task=walker_walk use_wandb=true

You can also save video, replay buffer, trained model and logging by setting save_<video/buffer/model/logging>=true. All tested tasks are listed in cfgs/task

Results are saved in results/tcrl.csv. The results/plot.ipynb file can be used to plot resutls.

Citation

If you use this repo in your research, please consider citing our original paper:

@article{zhao2023simplified,
  title={Simplified Temporal Consistency Reinforcement Learning},
  author={Zhao, Yi and Zhao, Wenshuai and Boney, Rinu and Kannala, Juho and Pajarinen, Joni},
  journal={arXiv preprint arXiv:2306.09466},
  year={2023}
}

Acknowledgements

We thanks the TD-MPC, SAC-PyTorch and DrQv2 authors for their high-quality source code.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cfgs		cfgs
media		media
results		results
utils		utils
README.md		README.md
environment.yaml		environment.yaml
main.py		main.py
tcrl.py		tcrl.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TCRL: Simplified Temporal Consistency Reinforcement Learning

Method

Instructions

Install dependencies

Train the agent

Citation

Acknowledgements

About

Releases

Packages

Languages

zhaoyi11/tcrl

Folders and files

Latest commit

History

Repository files navigation

TCRL: Simplified Temporal Consistency Reinforcement Learning

Method

Instructions

Install dependencies

Train the agent

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages