DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Authors: Jaemin Cho, Abhay Zala, and Mohit Bansal (UNC Chapel Hill)
Paper

Visual Reasoning

Please see ./paintskills for our DETR-based visual reasoning skill evaluation.

(Optional) Please see https://github.com/aszala/PaintSkills-Simulator for our 3D Simulator implementation.

Social Bias

Please see ./biases for our social (gender and skin tone) bias evaluation.

Image Quality & Image-Text Alignment

Please see ./quality for our image quaity evaluation based on FID score.

Please see ./retrieval for our image-text alignment evaluation with CLIP-based R-precision.

Please see ./captioning for our image-text alignment evaluation with VL-T5 captioning.

Models

We provide inference scripts for DALLE-small (DALLE-pytorch), minDALL-E, X-LXMERT, and Stable Diffusion.

Acknowledgments

We thank the developers of DETR, DALLE-pytorch, minDALL-E, X-LXMERT, and Stable Diffusion for their public code release.

Reference

Please cite our paper if you use our dataset in your works:

@inproceedings{Cho2023DallEval,
  title         = {DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models},
  author        = {Jaemin Cho and Abhay Zala and Mohit Bansal},
  year          = {2023},
  booktitle     = {ICCV},
}

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
assets		assets
biases		biases
captioning		captioning
models		models
paintskills		paintskills
quality		quality
retrieval		retrieval
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Visual Reasoning

Social Bias

Image Quality & Image-Text Alignment

Models

Acknowledgments

Reference

About

Releases

Packages

Languages

License

j-min/DallEval

Folders and files

Latest commit

History

Repository files navigation

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Visual Reasoning

Social Bias

Image Quality & Image-Text Alignment

Models

Acknowledgments

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages