In this work we harness the power of Microsoft Phi3 Mini 3.8
on ONXX
CPU runtime. We build a PDF Q/A system with nomic-embed-text-v1
as embedding moel
faiss
as Vector DB.
- pre_processing.py: Contains code for parsing PDF file, creating Embedding and Vector DB.
- application.ipynb: This notebook for creating a pdf Q/A pipeline.
- app.py: Code for Gradio Application. The app is hosted on
HF Space
- LLM: Phi3 Mini
- Embedding Model: nomic-embed-text-v1
- Vector DB: faiss
- Application: Gradio
- Install libraries with
make install
- Prepare Phi3 Mini with
ONXX CPU Runtime in Linux
withmake phi3_dependency
- Run run the app execute
python app.py
Microsoft
for the open source Phi3 Mini Quantized along with ONXX Runtime support.Hugging Face
for the all the educational and open source resources.