sd_piper: add module for piper speech synthesis #998

samoverton · 2025-01-24T17:08:31Z

I have been working on a dedicated piper module (for #866). I just pulled from master and noticed that the latest commit (bec5519) alludes to work in progress on a cxxpiper module, so I wanted to share my work asap to ensure that I wasn't stepping on any toes and see if you would like to collaborate.

The module makes use of the user's installation of the piper binary instead of linking in the piper codebase directly. This means that each speak request forks a child process with the appropriate arguments. Communication with the child is done over pipes. Server audio is used so speechd handles output to the configured audio device.

The module supports usual speechd configuration and runtime parameters:

AddVoice directives for configuring the voice models to use
rate- setting of voice speed (between 0.5x and 2x)
TEXT, CHAR, SPELL message types
STOP and PAUSE events

As well as some piper specific configuration:

Voice sample rate is read from model manifest json file
sentence_silence - seconds of silence after each sentence
noise_scale - generator noise
noise_w - phoneme width noise
audio buffer size (ms)

This changeset is based on master, but I originally did this work on the 0.11 branch, so it can be easily backported if required.

samoverton · 2025-01-24T21:09:15Z

I just came across #996 and I see that it takes a different approach, but I will leave this work here in case it is useful to anyone, or in the event that bringing piper code into the speech-dispatcher codebase turns out not to be viable.

azakharchenko-msol · 2025-01-25T15:24:13Z

@samoverton Great work 👍 works great and solves #999
Not exactly understand the logic of const char* piper_get_voice(SPDMsgSettings* p_settings)
the spd-say works only if I pass -t female1 otherwise it prints "no voice found" in log.
Is the default voice should be set somewhere?

samoverton · 2025-01-25T20:00:28Z

the spd-say works only if I pass -t female1 otherwise it prints "no voice found" in log. Is the default voice should be set somewhere?

You can set the default voice for the module in the piper.conf:

DefaultVoice "en_GB-alba-medium.onnx"

Or you can set the default language and voice-type in speechd.conf:

DefaultVoiceType "female1"
DefaultLanguage "en-GB"

You're right that there is some weird behavior here that's worth looking at though. Did you have any of these defaults set in speechd.conf?

azakharchenko-msol · 2025-01-26T06:13:22Z

@samoverton Thank you, I missed DefaultVoiceType "female1" in my speechd.conf

samoverton added 2 commits January 24, 2025 16:22

sd_piper: add module for piper speech synthesis

3f3a952

sd_piper: add license header to piper.c

a47dd28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sd_piper: add module for piper speech synthesis #998

sd_piper: add module for piper speech synthesis #998

samoverton commented Jan 24, 2025

samoverton commented Jan 24, 2025

azakharchenko-msol commented Jan 25, 2025

samoverton commented Jan 25, 2025

azakharchenko-msol commented Jan 26, 2025

sd_piper: add module for piper speech synthesis #998

Are you sure you want to change the base?

sd_piper: add module for piper speech synthesis #998

Conversation

samoverton commented Jan 24, 2025

samoverton commented Jan 24, 2025

azakharchenko-msol commented Jan 25, 2025

samoverton commented Jan 25, 2025

azakharchenko-msol commented Jan 26, 2025