TGIF: Text-Guided Inpainting Forgery Dataset

This dataset contains approximately 75k fake images, manipulated by text-guided inpainting methods (SD2, SDXL, and Adobe Firefly). The authentic images originate from MS-COCO, with a CC BY 4.0 license, and have resolutions up to 1024x1024 px. We provide both the manipulated image where the inpainted area is spliced in the original image (SD2-sp, PS-sp), as well as the fully-regenerated image (SD2-fr, SDXL-fr), when possible.

The dataset corresponds to the paper "TGIF: Text-Guided Inpainting Forgery Dataset", which was accepted at the IEEE International Workshop on Information Forensics & Security 2024.

We distribute this dataset under the CC BY-SA 4.0 license.

Visual explanation of our insights

Did the authors really go skiing on Greece's iconic Mt. Athos?

The image above is fake - the skis were added using text-guided inpainting. Can current forensic methods detect this manipulation?

Find out in our blog post, where we explain our insights in a simple and visual way.

Dataset specifications

How we created 75k fake images using SD2, SDXL, and Adobe Photoshop/Firefly. We used 2 types of masks, and differentiate between spliced and fully regenerated inpainted images. Note seen in the diagram: each inpainting operation creates 3 variations in batch.

Manipulation types
# masks	2 (segmentation & bounding box)
# variations (num_images_per_prompt)	3 per generation (in batch)
# sub-datasets	4 (SD2-sp, PS-sp, SD2-fr, SDXL-fr)
Total # manipulated images per authentic image	2 * 3 * 4 = 24

Dataset size	Training	Validation	Testing	Total
# authentic images	2 440	341	343	3 124
# manipulated images	58 560	8 184	8 232	74 976

Download

Download all images

The downloads are organized in masks, original, SD2-sp, PS-sp, SD2-fr, SDXL-fr. And each of those are separated in training, validation, and testing, respectively. The manipulated image filenames are as follows:

Metadata (incl. NIMA, GIQA & ITM scores) is available in this repository (metadata), as well as the benchmark results (benchmark-results).

Code to perform text-guided inpainting with SD2, SDXL & Adobe Photoshop/Firefly is added in the code folder of this repository, as well as code to calculate NIMA, GIQA, and ITM scores, and to compress images using JPEG and WEBP.

Filenaming

The files are named as follows:

orig:
- {coco_id}_orig.png
- {coco_id}_orig_{crop_size}.png
masks:
- {coco_id}_mask_{crop_size}.png
- {coco_id}_mask_{mask_type}.png
- - {coco_id}_mask_{mask_type}_{crop_size}.png
- {coco_id}_mask_{mask_type}.png_ps_mask.png - Photoshop adaptation of mask (extra border)
SD2-sp: {coco_id}_mask_{mask_type}.png_ps_mask.png_sd2_{var_id}.png
PS-sp: {coco_id}_mask_{mask_type}.png_ps_{var_id}.png
SD2-fr: {coco_id}_mask_{mask_type}.png_sd2-512_{var_id}.png
SDXL-fr: {coco_id}_mask_{mask_type}.png_sdxl-1024_{var_id}.png

With

crop_size: 512 or 1024
mask_type: bbox or segm
var_id: 0, 1, or 2

Reference

This work will be presented in the IEEE International Workshop on Information Forensics & Security 2024. The preprint can be downloaded on arXiv.

@InProceedings{mareen2024tgif,
  author="Mareen, Hannes and Karageorgiou, Dimitrios and Van Wallendael, Glenn and Lambert, Peter and Papadopoulos, Symeon",
  title="TGIF: Text-Guided Inpainting Forgery Dataset",
  booktitle="Proc. Int. Workshop on Information Forensics and Security (WIFS) 2024",
  year="2024"
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
benchmark-results		benchmark-results
code		code
metadata		metadata
readme-images		readme-images
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TGIF: Text-Guided Inpainting Forgery Dataset

Visual explanation of our insights

Dataset specifications

Download

Filenaming

Reference

About

Releases

Packages

Languages

License

IDLabMedia/tgif-dataset

Folders and files

Latest commit

History

Repository files navigation

TGIF: Text-Guided Inpainting Forgery Dataset

Visual explanation of our insights

Dataset specifications

Download

Filenaming

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages