Introduce SentenceTransformer Reranker #1810

machatschek · 2024-03-30T19:13:51Z

Description

This PR introduces support for document reranking, specifically leveraging SentenceTransformer cross-encoders. The addition aims to provide a lightweight and optimized approach to reranking, enhancing the model's response quality and speed.

Motivation

The integration of a reranking feature addresses the need for more relevant and accurate responses by pre-filtering documents before answer generation. The choice of SentenceTransformer's cross-encoder as the reranker is motivated by its efficiency and effectiveness in identifying the most relevant documents, compared to traditional LLM-based reranking methods.

Changes Made

Added an optional SentenceTransformerRerank node_postprocessor to ChatService, which facilitates the reranking process using the SentenceTransformer cross-encoder.
Updated settings to include rerank-specific configurations, allowing users to enable and customize reranking according to their specific needs. This includes parameters such as model and top_n, where the latter controls the selection process of documents for final response generation.
Documentation updates to guide users through enabling reranking, installing necessary dependencies (poetry install --extras rerank-sentence-transformers), and configuring rerank settings effectively for optimal performance.

The reranking feature is disabled by default to accommodate the additional dependencies and the need for users to adjust configurations based on their unique use cases.

Future Considerations

Looking ahead, there's potential to further refine the reranking functionality by:

Creating a dedicated reranker component: This would facilitate the incorporation of different reranker types, enhancing flexibility and customization for users.
Expanding reranker support: Exploring and integrating additional reranker methods beyond the SentenceTransformer cross-encoder could provide users with more options to tailor the reranking process to their specific requirements.

…xtChatEngine

pabloogc

nice job, useful feature too 👏

imartinez

Nice contribution!

danielgallegovico

Neat contribution!

icsy7867 · 2024-05-02T00:33:03Z

Does llamacpp-python need to be rebuilt for gpu? I would assume so. I've offloaded all of my gpu stuff, so I might have to update my pipeline if so @machatschek

machatschek · 2024-05-02T05:36:58Z

Does llamacpp-python need to be rebuilt for gpu? I would assume so. I've offloaded all of my gpu stuff, so I might have to update my pipeline if so @machatschek

The SentenceTransformer reranker does not rely on llamacpp-python. If you want to run the reranker on GPU, you would need to install GPU-enabled PyTorch version. SentenceTransfromer will then use GPU by default if one is available. If you want to overwrite this behaviour, you can set the device parameter in SentenceTransformerRerank to "cpu".

machatschek added 5 commits March 30, 2024 16:46

add SentenceTransformerRerank as optional node_postprocessor in Conte…

efb79a2

…xtChatEngine

add RerankSettings to RagSettings

1f055e9

update settings.yaml

ba85ab7

add optional reranker dependencies

5d1d0d6

add docs for reranker

11b1aca

pabloogc previously approved these changes Apr 1, 2024

View reviewed changes

fix spelling error in docs.yml

9e870fe

machatschek dismissed pabloogc’s stale review via 9e870fe April 1, 2024 11:13

danielgallegovico self-requested a review April 1, 2024 14:56

imartinez approved these changes Apr 1, 2024

View reviewed changes

danielgallegovico approved these changes Apr 2, 2024

View reviewed changes

imartinez merged commit 83adc12 into zylon-ai:main Apr 2, 2024
7 of 8 checks passed

github-actions bot mentioned this pull request Apr 2, 2024

chore(main): release 0.5.0 #1708

Merged

mrepetto-certx pushed a commit to mrepetto-certx/privateGPT that referenced this pull request Apr 18, 2024

feat(RAG): Introduce SentenceTransformer Reranker (zylon-ai#1810)

5fa2f25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce SentenceTransformer Reranker #1810

Introduce SentenceTransformer Reranker #1810

machatschek commented Mar 30, 2024

pabloogc left a comment

imartinez left a comment

danielgallegovico left a comment

icsy7867 commented May 2, 2024

machatschek commented May 2, 2024 •

edited

Loading

Introduce SentenceTransformer Reranker #1810

Introduce SentenceTransformer Reranker #1810

Conversation

machatschek commented Mar 30, 2024

Description

Motivation

Changes Made

Future Considerations

pabloogc left a comment

Choose a reason for hiding this comment

imartinez left a comment

Choose a reason for hiding this comment

danielgallegovico left a comment

Choose a reason for hiding this comment

icsy7867 commented May 2, 2024

machatschek commented May 2, 2024 • edited Loading

machatschek commented May 2, 2024 •

edited

Loading