Text-to-Speech Processing

This repository contains a set of Python scripts for processing text-to-speech (TTS) tasks, including splitting sentences, generating silence, and creating subtitles.

This is meant for long form text for example converting an E-book to a realistic sounding TTS audio book with matching subtitles.

Installation

Clone the repository:

git clone https://github.com/dahvo/GPT-TTS.git
cd GPT-TTS
git clone https://github.com/yourusername/tts-processing.git
cd tts-processing

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Synthesize Speech

Create a new file in the home directory named main.py

from do_tts import text_to_tts()

with open("text_example.txt"

Or whatever the file is called that holds the text you are converting to TTS. The text to voice conversion is set up so that any size of text can be converted, even a book works without a hitch

, "r", encoding="utf-8") as file:
        input_text = file.read()

    speaker = "audiobook_lady"

Choose a speaker, either one from my examples or you may create your own speaker embedding using clone_a_speaker.py

    output_name = "test_output"
    #Name of the output for the audio and srt file

    text_to_tts(input_text, speaker, output_name)
    ```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Text-to-Speech Processing

Installation

Usage

Synthesize Speech

Files

README.md

Latest commit

History

README.md

File metadata and controls

Text-to-Speech Processing

Installation

Usage

Synthesize Speech