Skip to content

Latest commit

 

History

History
81 lines (73 loc) · 2.51 KB

README.md

File metadata and controls

81 lines (73 loc) · 2.51 KB

Dataset Preparation

Below, you find instructions for retrieving the individual datasets. Once downloaded, The path to the datasets should be specified in .env:

DATASETS_PATH=/path/to/datasets

It is assumed that datasets follow the structure

dataset_name
|_ data
|  |_ [video name 0]
|  |  |_ [video name 0]_000001.jpg
|  |  |_ [video name 0]_000002.jpg
|  |  |_ ...
|  |_ [video name 1]
|     |_ [video name 1]_000001.jpg
|     |_ [video name 1]_000002.jpg
|     |_ ...
|_ splits
|  |_ train.csv
|  |_ val.csv

Kinetics-400

Kinetics is a large-scale dataset for Trimmed Human Activity Recognition, consisting of 10 second videos collected from YouTube, ranging over 400 classes. Due to it's origin, a direct download of the complete dataset is not possible. Instead, a list of videos and corresponding labels can be downloaded here, and a YouTube Crawler can subsequently be employed to collect the videos one by one. Note: this process may take multiple days.

It is assumed that the dataset has the following structure:

kinetics400
|_ data
|  |_ validate
|  |  |_ abseiling/
|  |  |_ air drumming/
|  |  |  |_ 3caPS4FHFF8_000036_000046.mp4*
|  |  |  |_ 3yaoNwz99xM_000062_000072.mp4*
|  |  |  |_ ...
|  |  |_ ...
|  |_ test
|  |  |_ ...
|  |_ train
|  |  |_ ...
|_ splits
|  |_ test.csv*
|  |_ test.json*
|  |_ train.csv*
|  |_ train.json*
|  |_ validate.csv*
|  |_ validate.json*

Charades

Charades is multi-label action classification dataset comprised of 9848 videos of daily indoors activities collected through Amazon Mechanical Turk. Steps to prepare dataset.

  1. Please download the Charades RGB frames from the dataset provider.

  2. Download the frame list from the following links: (train, val).

It is assumed that the dataset has the following structure:

charades
|_ data
|  |_ Charades_v1_rgb
|  |  |_ 001YG/
|  |  |  |_ 001YG-000001.jpg
|  |  |  |_ 001YG-000002.jpg
|  |  |  |_ ...
|  |  |_ 003WS/
|  |  |  |_ ...
|  |  |_ ...
|_ splits
|  |_ train.csv
|  |_ val.csv
|  |_ Charades_v1_test.csv
|  |_ Charades_v1_train.csv
|  |_ ...