Verify compatibility between `Data2VecVision` models and existing retrievers #2865

ZanSara · 2022-07-21T14:18:02Z

Context

Part of Add support for images #2418
After the simplification of language_model.py and tokenization.py, adding new supported model types in Haystack has been heavily simplified
The entire framework is still oriented heavily towards question answering on text, and this assumption is embedded into the code in many parts of the stack

Goal

Verify if any existing retriever can load a image retrieval model such as Data2VecVision with minor changes along the way
- If it can, consider a small refactoring to make the code paths more generic (change get_tokenizer into get_feature_extractor and so on)
- If it cannot in its current state, even with minor adaptation, consider creating a separate ImageRetriever class that can do that. Also evaluate if the underlying stack (Inferencer, Processor, AdaptiveModel etc) can be leveraged or not, and to which degree.

The text was updated successfully, but these errors were encountered:

ZanSara · 2022-07-27T10:16:22Z

An attempt to generalize TableTextRetriever to work with images quickly proved too complex for the scope of this issue.

Rather than modifying an existing Retriever with the risk of breaking working code, I opted for cloning TableTextRetriever and its stack of supporting classes and perform the changes needed to support N models rather than just 3 (query, text and tables).

The goal of this issue then changes to the following:

Create a multi modal retriever called MultiModalRetriever by generalizing the concepts introduced by TableTextRetriever
It introduces a stack of new subclasses to support such retriever, such as:
- MultiAdaptiveModel (from TriAdaptiveModel)
- EmbeddingSimilarityHead (from TextSimilarityHead)
- MultiModalSimilarityProcessor (from TableTextSimilarityProcessor)

Note that this Retriever will NOT be tested for working in pipelines, but only to work in isolation. It will also, most likely, stay undocumented. See #2418 for the rationale.

ZanSara · 2022-08-30T08:39:44Z

Continues in #2857

ZanSara added type:feature New feature or request topic:modeling type:refactor Not necessarily visible to the users topic:retriever topic:images labels Jul 21, 2022

ZanSara self-assigned this Jul 21, 2022

ZanSara mentioned this issue Jul 21, 2022

Add support for images #2418

Closed

8 tasks

ZanSara changed the title ~~Verify compatibility between Data2VecVision models and Embedding Retriever~~ Verify compatibility between Data2VecVision models and existing retrievers Jul 21, 2022

ZanSara mentioned this issue Jul 27, 2022

feat: MultiModalRetriever #2891

Merged

3 tasks

ZanSara closed this as completed Aug 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify compatibility between `Data2VecVision` models and existing retrievers #2865

Verify compatibility between `Data2VecVision` models and existing retrievers #2865

ZanSara commented Jul 21, 2022 •

edited

Loading

ZanSara commented Jul 27, 2022

ZanSara commented Aug 30, 2022

Verify compatibility between Data2VecVision models and existing retrievers #2865

Verify compatibility between Data2VecVision models and existing retrievers #2865

Comments

ZanSara commented Jul 21, 2022 • edited Loading

ZanSara commented Jul 27, 2022

ZanSara commented Aug 30, 2022

Verify compatibility between `Data2VecVision` models and existing retrievers #2865

Verify compatibility between `Data2VecVision` models and existing retrievers #2865

ZanSara commented Jul 21, 2022 •

edited

Loading