Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not for merge: Demo - Deep Search + InstructLab RAG integration #44

Closed
wants to merge 2 commits into from

Conversation

nerdalert
Copy link
Member

@nerdalert nerdalert commented Jul 8, 2024

Example 1 (Components: Deep Search RAG + InstructLab 4-bit quantized)

  • Perform a RAG query against a realtime collection of documents of data not included in the trained model.
  • This could also represent private organizational documents that need to stay on premise, air-gapped or any other data privacy policy since all components are local.
    -Select the collection corresponding to the topic or have a generic data collection for all topics depending on the desired data policy.
  • Without RAG, the model attempts to answer the question but its a hallucination since it is not trained on US Q1Y2024 economic data.
rag-demo1-economic-data.mp4

Example 2 (Components: Deep Search RAG + InstructLab 4-bit quantized)

  • Perform a RAG query against a collection of Annual Report filings.
  • The question is left generic to show the ability to query an entire corpus of documents vs drilling down into the collection and querying a specific document.
  • The first query will use the top result (chunk) and feed it back to InstructLab for answering which happens to be Disney's filing.
  • In the second query, we choose the exact document(s) for querying which happens to be Amazon's filing, thus returning information from that filing.
  • There are many patterns to optimize RAG with sending larger and/or more chunks but this is a simple example exhibiting the importance of managing data sourcing and documents.
rag-demo2-accounting-data.mp4

- See videos in the PR comments

Signed-off-by: Brent Salisbury <[email protected]>
@nerdalert nerdalert marked this pull request as draft July 8, 2024 03:06
@vishnoianil vishnoianil added the Demo PR that contains Demo related changes label Aug 22, 2024
@nerdalert
Copy link
Member Author

Closing since there is a newer demo PR with docling running as a standalone service.

@nerdalert nerdalert closed this Oct 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Demo PR that contains Demo related changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants