This repository contains the files needed to generate the voice packages used in EdgeTX.
The currently supported languages are:
- Czech
- German
- English
- Spanish
- French
- Italian
- Portuguese
- Russian
- Chinese Mandarin
- Chilean Spanish
The following languages are not yet supported but are under development:
- Hungarian
- Dutch
- Swedish
- Slovak
This folder has the audio files already processed and separated by language.
To use them, the language folder (for example, en
) must be under the SOUNDS
folder of your SDCARD
. With the folder added, go to the EdgeTX settings menu and select the language of the audio language that will be used (eg English).
To use any audio on your switches, first copy the file you want to use to your language folder, then you can use this file in your Global Functions
or Special Functions
by selecting a switch for the function and choosing the Play track
option.
All of the voices used in the EdgeTX voice packs have been picked from the neural voices offered by Microsft Azure text to speech service, in order to get as close as possible to humanlike voices. If you want to see what voices are available, and try different phrases, check out the online demo generator. Using some recording software, you could even save your own phrases and use them in the voice packs.
In order to generate the voice packages and do the release processing, you will need a Linux environment to run in. Ubuntu 18.04 is recommended as it is a LTS release. Newer versions and other flavours of Linux will most likely work also, but are not supported.
You will also need to have ffmpeg
, spx
and ffmpeg-normalize
packages installed.
ffmpeg
is used to clip any silence from the audio files. ffmpeg-normalise
is used to normalise the audio files.
spx
is the tool that generates the audio files using Microsoft Azure Text to Speech processing.
Installing SPX can be a little tricky, but can be installed as follows:
wget https://packages.microsoft.com/config/ubuntu/20.04/packages-microsoft-prod.deb -O packages-microsoft-prod.deb
sudo dpkg -i packages-microsoft-prod.deb
sudo apt-get update; \
sudo apt-get install -y apt-transport-https && \
sudo apt-get update && \
sudo apt-get install -y dotnet-sdk-3.1
dotnet tool install --global Microsoft.CognitiveServices.Speech.CLI
After you have installed SPX, you will also need to create a Microsoft Azure account if you don't have one already. There are both free and paid options, but the free one is sufficient for this purpose - it is just rate limited. After you have done that, follow the quick start guide to configure the required region and subscription keys.
- Mike has created a python script that can be used to generate the audio using Googles Text to Speech service - https://github.com/xsnoopy/edgetx-sdcard-sounds
- The OpenTX Speaker voice generator (Windows only) uses the built in text to speech engine of Microsoft Windows, andcan be used to generate new audio also. https://www.open-tx.org/2014/03/15/opentx-speaker
See CONTRIBUTING.md