[Feature request] document multi-speaker models #4026

surak · 2024-10-15T21:32:23Z

🚀 Feature Description

This is a request for improving the documentation. On the readme, you have a

List the available speakers and choose a <speaker_id> among them:
$ tts --model_name "<language>/<dataset>/<model_name>" --list_speaker_idxs

But you don't mention a model whatsoever, leaving the user to download all of the almost hundred models to figure which one actually does that.

Solution

Well, one example would be great - it's maybe obvious for people from the field; but it isn't for others.

Alternative Solutions

A partial download which would be enough to query every single model without downloading the whole set of weights, just metadata.

The text was updated successfully, but these errors were encountered:

Kreevoz · 2024-10-18T20:11:38Z

There aren't even that many models for a given language. The ones that have multispeaker capabilities would be using a multi-speaker dataset like vctk, which is listed when you query the available models. The ones based on ljspeech are all single-speaker models, since that is a single female speaker dataset.

eginhard · 2024-10-21T11:58:12Z

In theory such information could be added to the .models.json file, so that it can be accessed without downloading a model. I would consider a PR adding that information and exposing it in the API.

But I agree. Most languages only have very few models and even fewer datasets, so that currently it doesn't take a lot of effort to find out manually.

stale · 2024-12-08T09:24:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

surak added the feature request feature requests for making TTS better. label Oct 15, 2024

stale bot added the wontfix This will not be worked on but feel free to help. label Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] document multi-speaker models #4026

[Feature request] document multi-speaker models #4026

surak commented Oct 15, 2024

Kreevoz commented Oct 18, 2024

eginhard commented Oct 21, 2024

stale bot commented Dec 8, 2024

[Feature request] document multi-speaker models #4026

[Feature request] document multi-speaker models #4026

Comments

surak commented Oct 15, 2024

Kreevoz commented Oct 18, 2024

eginhard commented Oct 21, 2024

stale bot commented Dec 8, 2024