Advanced Deep Learning Project

This project aims to analyze the performance of state-of-the-art architectures for long context tasks using the Long Range Arena (LRA) dataset. The focus is on three network architectures: LSTM, Transformer, and State Space Models (S4), and three training strategies: direct training, pretraining on an external dataset, and pretraining on an LRA subset.

Installation

To set up the project, follow these steps:

# 1. Unzip the project folder
tar -xvzf ADV_ML_HW_1.tar.gz
cd ADV_ML_HW_1

# 2. Create a virtual environment and activate it.

python3 -m venv venv
source venv/bin/activate
  
# 3. Install the required packages.
pip install -r requirements.txt

Dataset Download

To download the LRA dataset, follow these steps:

# 1. Download the dataset.
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
mkdir -p ./data/raw
tar -xvzf lra_release.gz --strip-components=2 -C ./data/raw lra_release/lra_release/listops-1000
rm lra_release.gz

To download the MathQA dataset, follow these steps:

# 1. Download the dataset.
wget https://math-qa.github.io/math-QA/data/MathQA.zip
mkdir -p ./data/raw
unzip MathQA.zip -d ./data/raw/mathqa
rm MathQA.zip

Directory Structure

.
├── src/
│   ├── configs/ # Contains the configuration files for the experiments.
│   │   ├── try1.py
│   │   └── ...
│   ├── datasets/ # Contains the dataset classes.
│   │   ├── base_dataset.py # Base class for the datasets.
│   │   ├── listops_dataset.py # Main dataset
│   │   ├── mathqa_dataset.py # External dataset
│   │   └── retrieval_dataset.py # LRA secondary dataset
│   ├── models/ # Contains the model classes.
│   │   ├── architecture.py # Base class for the models.
│   │   ├── lstm.py
│   │   ├── s4.py
│   │   └── transformer.py
│   └── utils/
│       ├── config_types.py
│       ├── experiment_runner.py
│       └── metrics.py
├── requirements.txt
└── main_notebook.py # Main notebook to run the experiments.

Running the Experiment

Follow installation and download instructions.
Modify the configuration file in the configs directory.
Run the main notebook, choose the configuration file, and run the experiment.
View the results using tensorboard to visualize the metrics.

tensorboard --logdir=tensorboard

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
scripts		scripts
src		src
.gitignore		.gitignore
Advanced Deep Learning Assignment 1.pdf		Advanced Deep Learning Assignment 1.pdf
README.md		README.md
requirements.txt		requirements.txt
requirements_cuda.txt		requirements_cuda.txt
train_all.py		train_all.py
train_one.py		train_one.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced Deep Learning Project

Table of Contents

Installation

Dataset Download

Directory Structure

Running the Experiment

About

Releases

Packages

Contributors 2

Languages

nirendy/sequence-models-comparisons

Folders and files

Latest commit

History

Repository files navigation

Advanced Deep Learning Project

Table of Contents

Installation

Dataset Download

Directory Structure

Running the Experiment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages