-
-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation enhancement #55
Comments
This is an enhancement, not a fix. What was documented was a known reliable mechanism, until someone like yourself pointed out alternatives ;) And in point of fact, since the documentation is about how to generate a release, well, actually, yes, you do need to be on Linux/WSL2, as that is the only supported OS for the scripts as written. However, they are were changed to python as part of an effort to make things platform agnostic (it was all shell scripts before). re: |
I didn't say this implementation is not good or doesn't follow best practices. It is very well written. |
I honestly don't know the difference... I would expect it is mostly just a wrapper to make things more object/python like. You might also like the stuff that doesn't need special functions - like the model names, and switch audio based simply on a file being in the right folder... Can make life much easier if you don't change the switch functions around on the field, since you don't even need to add SF lines for the switch audio then. https://doc.open-tx.org/manual-for-opentx-2-2/advanced-features/audio#model-folder-sound-files |
Please add a paragraph for custom phrases.
For custom phrases there is no need for
ubuntu
or different installations. Phrases can be generated withcurl
orhttp
client likepostman
.After building a text to speech resource in
Azure
you can use it byREST
calls (http
requests).The request url is:
https://<YOUR_RESOURCE_REGION>.tts.speech.microsoft.com/cognitiveservices/v1
You should add the following headers to your request: (EdgeTX supposed to support up to 32khz
.wav
file but in that range 8khz is the highest value supported by the conversion service. It's possible to select higher quality likeriff-48khz-16bit-mono-pcm
and convert to 32khz afterwards with another tool likeffmpeg
).And in the request body (raw) place your
ssml
(Change the voice name according to your preference, the full list is:tts.speech.microsoft.com/cognitiveservices/voices/list
):Generally speaking, the implementation in
voice-gen.py
can be a series ofhttp
requests without the proxy objects. I don't see any benefit if they require ton of installations, but I might be wrong here. I'll be happy to hear if they contribute somehow.It seems that it can be an
os
independent implementation.The text was updated successfully, but these errors were encountered: