GitHub - NeverBeLazyG/AiVoicer: AiVoicer - AI-Powered Voice Transcription and Text Processing Tool

Overview

AiVoicer is a powerful desktop application that provides seamless voice transcription and AI-powered text processing. With customizable hotkeys, you can quickly transcribe audio or process text using OpenAI's advanced language models. Even the UI is German it works in any Language. English UI will come in next release.

Features

Voice recording with global hotkey (can be changed in Settings)
AI-powered transcription using Whisper (local Modell with CPU or GPU) in every Language
Text processing capabilities:
- Error correction
- Rephrasing
- Translation (to English)
- Summarization
Systray integration
Customizable hotkeys

Prerequisites

Python 3.8+
pip (Python package manager)
CUDA-compatible GPU recommended (for faster processing)

Installation and Running

Automatic Setup (Windows)

Double-click start_aivoicer.bat
- This script will:
  - Create a virtual environment
  - Install dependencies
  - Launch the application

Manual Setup

Clone the repository:

git clone https://github.com/NeverBeLazyG/AiVoicer.git
cd AiVoicer

Create a virtual environment:

python -m venv .venv
.venv\Scripts\activate  # On Windows

Install dependencies: bash

pip install -r requirements.txt

Run the application: bash

python aivoicer.py

Configuration

First-time setup requires an OpenAI API key
Go to application settings to configure (right click in Tray Icon):
- OpenAI API key
- Hotkeys

Hotkeys

Transcription Mode: Default Ctrl+Shift+F9
Text Processing Mode: Default Ctrl+Shift+F10

Dependencies

CustomTkinter for UI
OpenAI (GPT-4o-mini, can be changed) for text processing
Whisper for transcription
SoundDevice for audio recording

Troubleshooting

Ensure you have a valid OpenAI API key
Check that your system meets the audio recording requirements
For GPU acceleration, ensure CUDA is properly installed

License

This project is licensed under the Apache License 2.0. You may use, distribute, and modify the code under the terms of this license. See the LICENSE file for full details.

Contributing

Contributions are welcome! Please submit pull requests or open issues.

Disclaimer

This tool requires an active internet connection and an OpenAI API key for full functionality.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aivoicer.py		aivoicer.py
aivoicer_console.bat		aivoicer_console.bat
icon-rec.ico		icon-rec.ico
icon.ico		icon.ico
requirements.txt		requirements.txt
start_aivoicer.bat		start_aivoicer.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Features

Prerequisites

Installation and Running

Automatic Setup (Windows)

Manual Setup

Configuration

Hotkeys

Dependencies

Troubleshooting

License

Contributing

Disclaimer

About

Releases

Packages

Languages

License

NeverBeLazyG/AiVoicer

Folders and files

Latest commit

History

Repository files navigation

Overview

Features

Prerequisites

Installation and Running

Automatic Setup (Windows)

Manual Setup

Configuration

Hotkeys

Dependencies

Troubleshooting

License

Contributing

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages