-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue]: QdrantRetrieveUserProxyAgent is missing support for text-embedding-ada-002 embedding model #1282
Comments
We are currently understaffed on the RAG front. Would you be willing to submit a PR to fix this issue? |
We've come up with a complete pull request for this issue using any general embedding function that returns a list of embeddings. We'll post our pull request in the next few hours. |
Is this implemented? |
We'll submit the draft pull request for review. |
I have also just tried to use ada-embedding by performing the vectorization of chunks outside of Autogen, using LlamaIndex. And then trying to query the populated QDrant database using the notebook example at https://github.com/microsoft/autogen/blob/main/notebook/agentchat_qdrant_RetrieveChat.ipynb However, the error message is that |
Close as no active response for a long time. Please reopen it or create a new issue if needed. |
Describe the issue
Issue Overview:
In this GitHub issue, the proposal for implementing QdrantRetrieveUserProxyAgent has been successfully executed. However, upon attempting to use the feature, it was discovered that the current implementation relies on the qdrant_client, which in turn depends on fastembedding. Consequently, only a specific set of models listed in SUPPORTED_EMBEDDING_MODELS are supported.
SUPPORTED_EMBEDDING_MODELS: Dict[str, Tuple[int, models.Distance]] = { "BAAI/bge-base-en": (768, models.Distance.COSINE), "sentence-transformers/all-MiniLM-L6-v2": (384, models.Distance.COSINE), "BAAI/bge-small-en": (384, models.Distance.COSINE), "BAAI/bge-small-en-v1.5": (384, models.Distance.COSINE), "BAAI/bge-base-en-v1.5": (768, models.Distance.COSINE), "intfloat/multilingual-e5-large": (1024, models.Distance.COSINE), }
.Enhancement Proposal:
It is suggested that the support for additional models be extended beyond the current list. A reference implementation, inspired by the approach taken in Issue 253, is provided below:
This adds dependencies on litellm but I think your contribution to this enhancement would greatly benefit the community by expanding the scope of supported models and enhancing the overall utility of QdrantRetrieveUserProxyAgent.
Steps to reproduce
No response
Screenshots and logs
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: