This plugin is a Python plugin that streamlines image deduplication workflows!
With this plugin, you can:
- Find exact duplicate images using a hash function
- Find near duplicate images using an embedding model and similarity threshold
- View and interact with duplicate images in the App
- Remove all duplicates, or keep a representative image from each duplicate set
fiftyone plugins download https://github.com/jacobmarks/image-deduplication-plugin
This operator finds near-duplicate images in a dataset using a specified similarity index paired with either a distance threshold or a fraction of samples to mark as duplicates.
This operator finds exact duplicate images in a dataset using a hash function.
This operator displays the images in a dataset that are near-duplicates of each other, grouped together.
This operator displays the images in a dataset that are exact duplicates of each other, grouped together.
This operator removes all near-duplicate images from a dataset.
This operator removes all exact duplicate images from a dataset.
This operator removes near-duplicate images from a dataset, keeping a representative image from each duplicate set.
This operator removes exact duplicate images from a dataset, keeping a representative image from each duplicate set.