This is a collection of research papers for Diffusion Model in RL. And the repository will be continuously updated to track the frontier of Diffusion RL.
Welcome to follow and star!
The Diffusion Model in RL was introduced by “Planning with Diffusion for Flexible Behavior Synthesis” by Janner, Michael, et al. It casts trajectory optimization as a diffusion probabilistic model that plans by iteratively refining trajectories.
There is another way: "Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning" by Wang, Z. proposed Diffusion Model as policy-optimization in offline RL, et al. Specifically, Diffusion-QL forms policy as a conditional diffusion model with states as the condition from the offline policy-optimization perspective.
- Bypass the need for bootstrapping for long term credit assignment.
- Avoid undesirable short-sighted behaviors due to the discounting future rewards.
- Enjoy the diffusion models widely used in language and vision, which are easy to scale and adapt to multi-modal data.
format:
- [title](paper link) [links]
- author1, author2, and author3...
- publisher
- key
- code
- experiment environment
-
MADiff: Offline Multi-agent Learning with Diffusion Models
- Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang
- Key: Multi-agent, Offline RL, Classifier-free
- ExpEnv: MPE, SMAC, Multi-Agent Trajectory Prediction (MATP)
-
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
- Suzan Ece Ada, Erhan Oztop, Emre Ugur
- Key: Offline RL, OOD Generalization
- ExpEnv: 2D-Multimodal Contextual Bandit, D4RL
-
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
- Cheng Chi, Siyuan Feng, Yilun Du, Zhenjia Xu, Eric Cousineau, Benjamin Burchfiel, Shuran Song
- Key: Robot Manipulation
- ExpEnv: Robomimic, Push-T, Multimodal Block Pushing, Franka Kitchen
-
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
-
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
- Moritz Reuss, Maximilian Li, Xiaogang Jia, Rudolf Lioutikov
- Key: Planning, Generalizability, Classifier-guided
- ExpEnv: Maze2D, MuJoCo, KUKA Robot
-
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
- Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo
- Key: Goal-Conditioned Imitation Learning, Robotics, Classifier-free
- ExpEnv: CALVIN, Block-Push, Relay Kitchen
-
Optimizing DDPM Sampling with Shortcut Fine-Tuning
- Ying Fan, Kangwook Lee
- Publisher: ICML 2023
- Key: Training Diffusion with RL, Online RL, Sampling Optimization
- Code: official
- ExpEnv: CIFAR10, CelebA
-
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
- Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang
- Publisher: ICML 2023
- Key: Offline meta-RL, Conditional Trajectory Generation, Generalization, Classifier-guided
- ExpEnv: MuJoCo
-
Is Conditional Generative Modeling all you need for Decision-Making?
-
Imitating Human Behaviour with Diffusion Models
- Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin
- Publisher: ICLR 2023
- Key: Offline RL, Policy Optimization, Imitation Learning, Classifier-free
- ExpEnv: Claw, Kitchen, CSGO
-
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
- Guided Conditional Diffusion for Controllable Traffic Simulation
- Ziyuan Zhong, Davis Rempe, Danfei Xu, Yuxiao Chen, Sushant Veer, Tong Che, Baishakhi Ray, Marco Pavone
- Publisher: ICRA 2023
- Key: Traffic Simulation, Multi-Agent, Classifier-free
- ExpEnv: nuScenes
- Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
- Zhendong Wang, Jonathan J Hunt, Mingyuan Zhou
- Publisher: NeurIPS Deep RL Workshop 2022
- Key: Offline RL, Policy Optimization
- Code: official, unofficial
- ExpEnv: MuJoco, D4RL
Our purpose is to make this repo even better. If you are interested in contributing, please refer to HERE for instructions in contribution.
Awesome Diffusion Model in RL is released under the Apache 2.0 license.