DQN-PyTorch

A PyTorch implementation of Human-level control through deep reinforcement learning

├── agents
|  └── dqn.py # the main training agent for the dqn
├── graphs
|  └── models
|  |  └── dqn.py
|  └── losses
|  |  └── huber_loss.py # contains huber loss definition
├── datasets  # contains all dataloaders for the project
├── utils # utilities folder containing input extraction, replay memory, config parsing, etc
|  └── assets
|  └── replay_memory.py
|  └── env_utils.py
├── main.py
└── run.sh

Environments:

CartPole V0:

Loss during training:

Number of durations per Episode:

Usage:

To run the project, you need to add your configurations into the folder configs/ as fround here
sh run.sh
To run on a GPU, you need to enable cuda in the config file.

Requirements:

Pytorch: 0.4.0
torchvision: 0.2.1
tensorboardX: 0.8

Check requirements.txt.

Future Work:

Test DQN on a more complex environment such as MS-Pacman

References:

Pytorch official example: https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html
Pytorch-dqn: https://github.com/transedward/pytorch-dqn/blob/master/dqn_learn.py

License:

This project is licensed under MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DQN-PyTorch

Table of Contents:

Project Structure:

Environments:

CartPole V0:

Usage:

Requirements:

Future Work:

References:

License:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.idea		.idea
agents		agents
configs		configs
datasets		datasets
graphs		graphs
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

License

hagerrady13/DQN-PyTorch

Folders and files

Latest commit

History

Repository files navigation

DQN-PyTorch

Table of Contents:

Project Structure:

Environments:

CartPole V0:

Usage:

Requirements:

Future Work:

References:

License:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages