AiVoicer is a powerful desktop application that provides seamless voice transcription and AI-powered text processing. With customizable hotkeys, you can quickly transcribe audio or process text using OpenAI's advanced language models. Even the UI is German it works in any Language. English UI will come in next release.
- Voice recording with global hotkey (can be changed in Settings)
- AI-powered transcription using Whisper (local Modell with CPU or GPU) in every Language
- Text processing capabilities:
- Error correction
- Rephrasing
- Translation (to English)
- Summarization
- Systray integration
- Customizable hotkeys
- Python 3.8+
- pip (Python package manager)
- CUDA-compatible GPU recommended (for faster processing)
- Double-click start_aivoicer.bat
- This script will:
- Create a virtual environment
- Install dependencies
- Launch the application
- This script will:
- Clone the repository:
git clone https://github.com/NeverBeLazyG/AiVoicer.git
cd AiVoicer
- Create a virtual environment:
python -m venv .venv
.venv\Scripts\activate # On Windows
- Install dependencies: bash
pip install -r requirements.txt
- Run the application: bash
python aivoicer.py
- First-time setup requires an OpenAI API key
- Go to application settings to configure (right click in Tray Icon):
- OpenAI API key
- Hotkeys
- Transcription Mode: Default Ctrl+Shift+F9
- Text Processing Mode: Default Ctrl+Shift+F10
- CustomTkinter for UI
- OpenAI (GPT-4o-mini, can be changed) for text processing
- Whisper for transcription
- SoundDevice for audio recording
- Ensure you have a valid OpenAI API key
- Check that your system meets the audio recording requirements
- For GPU acceleration, ensure CUDA is properly installed
This project is licensed under the Apache License 2.0. You may use, distribute, and modify the code under the terms of this license. See the LICENSE file for full details.
Contributions are welcome! Please submit pull requests or open issues.
This tool requires an active internet connection and an OpenAI API key for full functionality.