Name		Name	Last commit message	Last commit date
parent directory ..
datasets		datasets
doc		doc
0_shot_baseline.py		0_shot_baseline.py
README.md		README.md
environment.yml		environment.yml
utils.py		utils.py

README.md

CLIP based zero-shot baseline for MAD dataset

Introduction

This repo includes the code for the zero-shot baseline computed atop CLIP [Paper, Code] embeddings for the paper "MAD: A scalable dataset for language grounding in videos from movie audio descriptions" [ArXiv Preprint].

Installation

Clone the repository and move to the folder:

https://github.com/Soldelli/MAD
cd MAD/baselines/0ShotClip/

Install environmnet:

conda env create -f environment.yml

If installation fails, please follow the instructions in file doc/environment.md (link).

Data

Kindly get access to the data first. Follow the instructions on the main page of the repository (link).
Once you obtain the data, follow the folder structure highlighted below. Place the files in the correct folders and proceed with the rest of the README.

The folder structure should be as follows:

.
├── datasets
│   └── MAD
│       ├── annotations
│       │   ├── MAD_train.json
│       │   ├── MAD_val.json
│       │   └── MAD_test.json
│       └── features
│           ├── CLIP_language_features_MAD_test.h5
│           └── CLIP_frames_features_5fps.h5
└── doc

Evaluation

Simply run the command:

conda activate 0ShotCLIP
python 0_shot_baseline.py

Citation

If any part of our paper and code is helpful to your work, please cite with:

@inproceedings{soldan2021mad,
  title={MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions},
  author={Soldan, Mattia and Pardo, Alejandro and Alc{\'a}zar, Juan Le{\'o}n and Heilbron, Fabian Caba and Zhao, Chen and Giancola, Silvio and Ghanem, Bernard},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2022}
}

@inproceedings{radford2021learning,
  title={Learning transferable visual models from natural language supervision},
  author={Radford, Alec and Kim, Jong Wook and Hallacy, Chris and Ramesh, Aditya and Goh, Gabriel and Agarwal, Sandhini and Sastry, Girish and Askell, Amanda and Mishkin, Pamela and Clark, Jack and others},
  booktitle={International Conference on Machine Learning},
  pages={8748--8763},
  year={2021},
  organization={PMLR}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0ShotCLIP

0ShotCLIP

README.md

CLIP based zero-shot baseline for MAD dataset

Introduction

Installation

Data

Evaluation

Citation

Files

0ShotCLIP

Directory actions

More options

Directory actions

More options

Latest commit

History

0ShotCLIP

Folders and files

parent directory

README.md

CLIP based zero-shot baseline for MAD dataset

Introduction

Installation

Data

Evaluation

Citation