Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

This is the repository for the paper Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.

We empirically examine the representation complexity of model, optimal policy, and optimal value functions in various simulated MuJoCo environments.

Figure 1: Main results on various MuJoCo environments.

Installation

Set up the environment.

conda env create -f env.yaml

Usage

Train the oracle policy model by TD3.

bash ./train_TD3.sh

Generate the dataset by the policies.

bash ./rollout.sh

Compute the representation error.

bash ./repr_error.sh

Acknowledgement

This repository was built upon TD3.

Citation

If you find the content of this repo useful, please consider cite it as follows:

@article{feng2023rethinking,
  title={Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity},
  author={Feng, Guhao and Zhong, Han},
  journal={arXiv preprint arXiv:2312.17248},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
TD3.py		TD3.py
env.yaml		env.yaml
exp_result.png		exp_result.png
plot.py		plot.py
repr_error.py		repr_error.py
repr_error.sh		repr_error.sh
rollout.py		rollout.py
rollout.sh		rollout.sh
train_TD3.py		train_TD3.py
train_TD3.sh		train_TD3.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

Installation

Usage

Acknowledgement

Citation

About

Releases

Packages

Contributors 2

Languages

License

GuhFeng/RL-Representation-Complexity

Folders and files

Latest commit

History

Repository files navigation

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

Installation

Usage

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages