GitHub - raigon44/MojoQA: RAG-based LLM application to answer queries related to Mojo programming language

MojoQA

MojoQA is a RAG (Retrieval Augmented Generation) based LLM application that can answer queries related to Mojo programming language.

Introduced this year, Mojo is a novel programming language that seamlessly merges the strengths of Python syntax with elements of systems programming and metaprogramming, effectively bridging the divide between research and production.

Let's see what a LLAMA-2 7B model knows about the advantages of Mojo over Python!

As you can see the response isn't accurate and not very helpful.

Now let's see how Mojo QA bot performs.

Now that looks better!!

For creating Mojo QA Bot, I have extracted the official Mojo documentation and created a vector store containing corresponding embeddings. To answer each query, we retrieve the most similar embeddings and provide it to the LLM as context. Checkout the high level overview diagram below.

High level overview

How to reproduce?

Install dependencies

pip install -r requirements.txt

Please refer this link for the installation instructions of llama-cpp with OpenBLAS/ cuBLAS / CLBlast. The dependency I have added in the requirements.txt file is for CPU only installation.

Install the project as a package

pip install -e .

Download model and adapt the config.yaml file

For this project I have used 4 bit quantized LLAMA-2 7B model (using llama-cpp). For better result for the text generation, it is better to use the chat model. Kindly download and place the model in the ./models directory of this project. You can use other models too.

Models used in this project:

4 bit quantized LLAMA-2-7B

4 bit quantized LLAMA-2-7B-Chat

For using different models, please update the model path parameters in config/config.yaml file and mojoqa/config/conf.py file.

Create the vector store

python ./scripts/main.py

Start the MojoQA Bot

streamlit run ./streamlit/app.py

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
config		config
data		data
log		log
models		models
mojoqa		mojoqa
scripts		scripts
streamlit		streamlit
Avery_LLAMA2_response.JPG		Avery_LLAMA2_response.JPG
README.md		README.md
mojo_qa_bot_response.JPG		mojo_qa_bot_response.JPG
mojo_qa_highlevel.svg		mojo_qa_highlevel.svg
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MojoQA

High level overview

How to reproduce?

Install dependencies

Install the project as a package

Download model and adapt the config.yaml file

Create the vector store

Start the MojoQA Bot

Next Steps

About

Releases

Packages

Languages

raigon44/MojoQA

Folders and files

Latest commit

History

Repository files navigation

MojoQA

High level overview

How to reproduce?

Install dependencies

Install the project as a package

Download model and adapt the config.yaml file

Create the vector store

Start the MojoQA Bot

Next Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages