Replies: 5 comments 13 replies
-
Do you think that object detection datasets with a single label can be useful for DALL-E? It seems that the information content is quite low; but it could still learn the object names and various views and their typical backgrounds. |
Beta Was this translation helpful? Give feedback.
-
YFCC100m could also be added to the list: #110 (comment) as well as the conceptual captions dataset consisting of 3,318,333 image/caption pairs: https://ai.google.com/research/ConceptualCaptions/ |
Beta Was this translation helpful? Give feedback.
-
@afiaka87 good point with the with the copyright issue - I will always also add the license agreements for new datasets i find: 2.5 million images from 205 scene categories under the Create Common License One million labeled images for each of 10 scene categories and 20 object categories. (maybe a little bit too reductionist), I didn't find any information on license agreements despite "If you find LSUN dataset useful in your research, please consider citing"... 2,686,419 of AI-generated faces, e.g. "A white male with long black hair facing left" or "A black woman facing right with short white hair."
Bulk download request under: [email protected] 8,456,240 (or 6,464,018 cleaned) images of 94,682 celebrities under GNU General Public License v3.0 |
Beta Was this translation helpful? Give feedback.
-
Here is the resized coco dataset as a torrent: https://academictorrents.com/details/eea5a532dd69de7ff93d5d9c579eac55a41cb700 |
Beta Was this translation helpful? Give feedback.
-
https://github.com/rom1504/kaggle-fashion-dalle/releases/tag/1.0.0 contains preprocessed for dalle of https://www.kaggle.com/paramaggarwal/fashion-product-images-dataset |
Beta Was this translation helpful? Give feedback.
-
We'll need lots of data to train dalle-pytorch to the level OpenAI has with DALLE. If you find any new or interesting datasets that are either captioned or could have captions generated for them using class-name, etc. then please post here and I'll update the list:
We'll download these with aria2c, gdown and wget (for the WIT links). Make sure to:
apt install wget
python3 -m pip install aria2c gdown
COCO 2014 Resized to 256x256
edit: If someone could rehost this for me I'd appreciate it. That drive account isnt exactly production ready.
gdown "https://drive.google.com/file/d/1d7_N0Uxf4xYSSS-VcIVt4lBXFsUIbjvP/view"
Visual Genome
aria2c https://academictorrents.com/download/1bfe6871046860a2ff8c0cc1414318beb35dc916.torrent;
imagenet
aria2c https://academictorrents.com/download/96816a530ee002254d29bf7a61c0c158d3dedc3b.torrent;
STL-10
aria2c https://academictorrents.com/download/a799a2845ac29a66c07cf74e2a2838b6c5698a6a.torrent;
food-101
aria2c https://academictorrents.com/download/470791483f8441764d3b01dbc4d22b3aa58ef46f.torrent;
indoor CVPR
aria2c https://academictorrents.com/download/59aa0ad684e5d849f68bad9a6d43a9000a927164.torrent;
SVHN
aria2c https://academictorrents.com/download/6f4caf3c24803d114c3cae3ab9cb946cd23c7213.torrent;
OpenImagesV6 (only downloads the 256 px versions)
WIT
Here are the links to download the 10 files.
wit_v1.train.all-00000-of-00010.tsv.gz
wit_v1.train.all-00001-of-00010.tsv.gz
wit_v1.train.all-00002-of-00010.tsv.gz
wit_v1.train.all-00003-of-00010.tsv.gz
wit_v1.train.all-00004-of-00010.tsv.gz
wit_v1.train.all-00005-of-00010.tsv.gz
wit_v1.train.all-00006-of-00010.tsv.gz
wit_v1.train.all-00007-of-00010.tsv.gz
wit_v1.train.all-00008-of-00010.tsv.gz
wit_v1.train.all-00009-of-00010.tsv.gz
Beta Was this translation helpful? Give feedback.
All reactions