Create two folders imagenet_info
and text_info
in the current project directory
/path/to/DeCLIP/
├── docs/
├── experiments/
├── linklink/
├── prototype/
├── text_info/
├── imagenet_info/
...
-
First Download our YFCC15M label file - Google Driver and put it into
imagenet_info
dir -
Download Image data, You have two ways to download Image data:
- DownLoad by labels: Crawl the image by the url in label dirctely.
- Filter by label: Download offical YFCC100M data, and Prepare the YFCC15M subset metadata pickle by the label.
- Download our vocab file for Text encoder Google Driver
- put it into
text_info
dir
- DownLoad offical ImageNet Dataset
- DownLoad our ImageNet validation label file - Google Driver
- put it into
imagenet_info
dir