Skip to content

A step back to reconsider downloading 1.86GB model. Is this huge download necessary? #218

Answered by KoljaB
rajibando asked this question in Q&A
Discussion options

You must be logged in to vote

Thank you for sharing the details! The 1.86GB download is for the main Coqui TTS model (XTTS v2.0.2). For a modern TTS model that is supposed to fully infer locally, this is actually considered small. Many AI models require significant storage space to deliver high-quality results. For example, OpenAI's Whisper large-v2 model for ASR also involves downloading several gigabytes.

These large sizes are due to the neural network weights and configurations needed for the model to function effectively. High-quality TTS models rely on this data to produce natural, accurate speech synthesis. While 1.86GB may seem substantial, it is standard for AI models in this field.

To give you additional cont…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@rajibando
Comment options

Answer selected by rajibando
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants