`ImageRetriever` #2445

ZanSara · 2022-04-21T15:50:36Z

[Part of #2418]

What
ImageRetriever is a node that is capable of performing two operations:

Creating comparable embeddings for text and images
Comparing such embeddings to retrieve images that correspond to a given description.

How
ImageRetriever could be based on CLIP, which is available through sentence-transformers.

Given that ImageRetriever has clearly two responsibilities, we can consider splitting them between ImageEmbedder (only creates and stores the embeddings) and ImageRetriever (computes the embeddings for the query only and uses the pre-existing image embeddings from the document stores) #2403

Expected results

A pipeline should be capable to take a string as input and output a list of paths to images.
All document stores should be able to support image embeddings

NOT expected (yet):

Pipelines capable of returning mixed images and text results with comparable scores.

This new node is likely to be implemented in a single PR.

The text was updated successfully, but these errors were encountered:

julian-risch · 2022-05-11T11:26:11Z

@ZanSara I think we could make use of Data2Vec from transformers: https://huggingface.co/docs/transformers/main/en/model_doc/data2vec That would allow to do the implementation of the embedding calculation in the EmbeddingRetriever in just a few lines of code.

julian-risch · 2022-05-11T11:30:32Z

Data2VecVision has been recently added: https://github.com/huggingface/transformers/pull/16760/files
and Data2VecText and Data2VecAudio are already there.

julian-risch · 2022-05-18T15:04:56Z

@ZanSara There was a transformers release and Data2VecVision has also been part of the release: https://github.com/huggingface/transformers/releases

masci · 2022-07-20T13:17:05Z

Superseded by #2418

ZanSara added type:feature New feature or request topic:document_store topic:retriever topic:images labels Apr 21, 2022

ZanSara self-assigned this Apr 21, 2022

ZanSara mentioned this issue Apr 21, 2022

Add support for images #2418

Closed

8 tasks

ZanSara mentioned this issue Jun 22, 2022

Simplify language_modeling.py and tokenization.py #2703

Merged

2 tasks

ZanSara mentioned this issue Jul 20, 2022

Load CLIP Models into a Retriever for image retrieval #2857

Closed

masci closed this as completed Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ImageRetriever` #2445

`ImageRetriever` #2445

ZanSara commented Apr 21, 2022

julian-risch commented May 11, 2022 •

edited

Loading

julian-risch commented May 11, 2022

julian-risch commented May 18, 2022

masci commented Jul 20, 2022

ImageRetriever #2445

ImageRetriever #2445

Comments

ZanSara commented Apr 21, 2022

julian-risch commented May 11, 2022 • edited Loading

julian-risch commented May 11, 2022

julian-risch commented May 18, 2022

masci commented Jul 20, 2022

`ImageRetriever` #2445

`ImageRetriever` #2445

julian-risch commented May 11, 2022 •

edited

Loading