LLM-based Voice Assistant

This is an AI Voice Assistant based on Large Language Models. A user can interact with the Voice Assistant in natural language, currently English.

The implementation brings various deep learning models together:

Large Language Model (GPT-4 or Alpaca, can be chosen)
Speech-To-Text Model (Wave2Vec2-Large)
Text-To-Speech Model (Microsoft SpeechT5)

The speech module is interfaced with the local microphone to create live transcription via a VAD Process. A transcription is sent to the chosen LLM for processing based on wake words.

Once the LLM generates a response, speech module also saves the audio file and generates a speech output using a TTS model.

The User Interface is built using Streamlit and provides a familiar Chat-like experience.

Demo

Installation

Install project dependencies

pip install -r requirements

If using GPT Models, create a .env file with environment variables for OPENAI_API_KEY and OPENAI_API_BASE.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.streamlit		.streamlit
speech_module		speech_module
.gitignore		.gitignore
README.md		README.md
brains.py		brains.py
demo.png		demo.png
interface.py		interface.py
requirements.txt		requirements.txt
session_manager.py		session_manager.py
system_prompt.txt		system_prompt.txt
wake_words.txt		wake_words.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-based Voice Assistant

Demo

Installation

About

Releases

Packages

Languages

avsrma/LLM-based-AI-Assistant

Folders and files

Latest commit

History

Repository files navigation

LLM-based Voice Assistant

Demo

Installation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages