Skip to content

Latest commit

 

History

History
33 lines (28 loc) · 927 Bytes

README.md

File metadata and controls

33 lines (28 loc) · 927 Bytes

BRAC+: Improved Behavior Regularized Offline Reinforcement Learning

This repository is the official implementation of BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning.

Requirements

We high recommend that you create a new Python environment to test our code

Conda Environment

conda create -n bracp python=3.8

To install requirements:

Python package

pip install -r requirements.txt

D4RL library

pip install git+https://github.com/rail-berkeley/d4rl@master#egg=d4rl

rlutils library

pip install rlutils-python==0.0.3

Training

python d4rl_bracp.py train --env_name halfcheetah-medium-v0 --seed 110

The script will first pretrain the behavior policy and the initial policy that minimize the KL divergence.

Logging

The logs will be placed at data/d4rl_results/