RAG configs #4621
Replies: 3 comments 5 replies
-
Today I tried this on another PC, the outcome has been the same. Also, the file once added if I create a new chat or if I bring down the container and back up again, if I attach the same file again, the LLM do not get the context attached at all, even though you can see the file has been attached in the conversation. Could someone please help me in this, if I am missing anything? Thanks! |
Beta Was this translation helpful? Give feedback.
-
Do you have env var |
Beta Was this translation helpful? Give feedback.
-
What you are experiencing with deleted files still showing up in file manager in name only is that they are not "successfully" deleted. I would check your logs for rag_api delete http response should be 200 but if not then the file names and info don't get removed even if removed from database. |
Beta Was this translation helpful? Give feedback.
-
Here is my setup:
uncommented
EMBEDDINGS_PROVIDER=openai
EMBEDDINGS_MODEL=text-embedding-3-small
This is the setup and all the containers come up fine without any issues and I can get response from all OpenAI models and DeepSeek models.
Now, coming to file upload and embeddings.
Lobechat and embeddings:
I have been testing lobechat in parallel with Librechat for file uploads/embeddings with similar setup, OpenAI/DeepSeek.
When it comes to lobechat, I tried with a 5MB PDF file and upload was quick, chunking + embedding everything finished in less than 15-20 seconds. My system CPU barely went up in this case.
Librechat and embeddings (and issues and question):
I tried the exact same PDF file with librechat:
I tried a few more PDF's with both lobechat and librechat and steps 1-5 happened repeatedly. Lobechat would take few seconds, whereas librechat would take 10-15 mins per file. And the chat context of the PDF was available only in the chat when it was initially added. Anytime I referenced the files in another chat, LLM's had no clue about the details of the files.
I have used librechat previously (until the first time RAG was introduced, I remember file uploads was not this slow and I could add files to any number of chat contexts). Ya, IIRC we had to add config in rag.yml file back then? I am not sure what I am missing now?
Could someone please help me with the configs. Am I doing it correctly? Seems like I am missing something.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions