Pytorch SAC

Pytorch implementation of Soft Actor-Critic Algorithm

Soft actor-critic (SAC) is an off-policy actor-critic deep RL algorithm that optimizes stochastic continuous policies defined in the maximum entropy framework.

Paper: Haarnoja, T., Zhou, A., Abbeel, P., Levine, S. (2018), Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Installation

Clone the repository

git clone https://github.com/domingoesteban/pytorch_sac

Install required python packages

cd pytorch_sac
pip install -r requirements.txt

Usage

Run (train) the algorithm with a continuous observation-action space OpenAI-gym environment (E.g. Pendulum-v0)

python train.py -e Pendulum-v0 -i 30

Plot the results from the previous training process

python eval.py -p PATH_TO_LOG_DIRECTORY

Evaluate the resulted policy

python eval.py PATH_TO_LOG_DIRECTORY

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
logger		logger
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
models.py		models.py
plots.py		plots.py
requirements.txt		requirements.txt
sac.py		sac.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytorch SAC

Installation

Usage

About

Releases

Packages

Languages

domingoesteban/pytorch_sac

Folders and files

Latest commit

History

Repository files navigation

Pytorch SAC

Installation

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages