Universal representation learning for faster RL

Alex Nam and Alex Loia (CS 230 Fall 2021 Project)

Info

The default_curl_v3.ipynb notebook includes our current implementation of the CURL architecture (namely, the image encoder) along with our own data augmentations.

The resnet_pretrained.ipynb notebook uses the same RL architecture as above but uses a pretrained ImageNet ResNet model as the image encoder, which is used to contrast against the CURL encoder.

The curl+some_state-cartpole.ipnyb notebook is similar to the default CURL implementation except some velocity state is added to the image encodings to experiment with whether that is functionally equivalent to frame stacking.

The frame stacked variants are similar to the above but use two frames per observation instead of with the hopes of being able to natively encode velocity information between timesteps.

The pixel_baseline.ipynb notebook is a DQN implementation without encoding that is used a baseline of learning off pixels directly.

Setup

Run conda env create -f environment.yml and conda activate cs230proj. Then, run jupyter notebook to open the Jupyter Notebook webpage. Open the different notebooks and enjoy!

Runs on Linux only. May require xvfb package on headless systems.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
README.md		README.md
curl+some_state-cartpole.ipynb		curl+some_state-cartpole.ipynb
default_curl_v3.ipynb		default_curl_v3.ipynb
environment.yml		environment.yml
frame_stacked_cartpole.ipynb		frame_stacked_cartpole.ipynb
pixel_baseline.ipynb		pixel_baseline.ipynb
plot.py		plot.py
resnet_architecture_frame_stacked_cartpole.ipynb		resnet_architecture_frame_stacked_cartpole.ipynb
resnet_architecture_frame_stacked_cartpole_higher_epsilon.ipynb		resnet_architecture_frame_stacked_cartpole_higher_epsilon.ipynb
resnet_pretrained.ipynb		resnet_pretrained.ipynb
resnet_pretrained_frame_stacked_cartpole.ipynb		resnet_pretrained_frame_stacked_cartpole.ipynb
resnet_trainable_cartpole.ipynb		resnet_trainable_cartpole.ipynb
sample_gym_env_pretrained.ipynb		sample_gym_env_pretrained.ipynb
vgg_pretrained.ipynb		vgg_pretrained.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Universal representation learning for faster RL

Alex Nam and Alex Loia (CS 230 Fall 2021 Project)

Info

Setup

About

Releases

Packages

Contributors 2

Languages

nam630/rep_learning

Folders and files

Latest commit

History

Repository files navigation

Universal representation learning for faster RL

Alex Nam and Alex Loia (CS 230 Fall 2021 Project)

Info

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages