EST_lang_corpora_preprocessing

Introduction

This is the repository for the Bachelor's degree project of preprocessing Estonian speech corpora. In specific, the work focuses on removing leading and trailing silences and trimming of long pauses in the middle. The effect of the silence removal was later graded on speech synthesis models.

2 methods were created for this purpose, which can be found in the Jupyter Notebook SilenceRemovalMethods.ipynb. One of the created methods uses RMSE and ZCR based features to spot and remove silences. The other method uses an acoustic model based text alligner - Autosegmenteerija 2.0 - the work of Tanel Alumäe, Ottokar Tilk and Asadullah. In order to use the Autosegmenteerija to create the needed TextGrids for silence removal, the Jupyter Notebook AutosegmentedTextGridCreator.ipynb was created. The notebook does not come with Autosegmenteerija itself, so having it be running somewhere is a prerequisite.

Both the results for the preprocessing methods and for the models can be seen in Grading.xlsx

Requirements for running the Notebooks

SilenceRemovalMethods

Python >= 3.7.9
Librosa >= 0.8.0
Jupyter >= 1.0.0
Pandas >= 1.1.4
Numpy >= 1.19.2
Scipy >= 1.6.0
Matplotlib >= 3.3.2
EstNLTK >= 1.6.7beta (for transcript preprocessing, along with the preprocessor itself)
textgrid

AutosegmentedTextGridCreator

Python >= 3.6
Pandas >= 1.1.3
Jupyter >= 1.0.0

Models

3 models were created for the purpose of this work using the Deep Voice 3 adaptation for Estonian. For instructions on how to use the models, please refer to the previous link.
The models can be downloaded from here:

Example soundfiles

Sadly Github does not allow embedded audio in a readme, so to listen to the audio files, please download the zip of the audio files
Alternatively, each audio file can be downloaded seperately from the audio example folder
The zip/folder contains both preprocessed and synthesized audio examples, along with the original (not preprocessed) audio files

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
ExampleAudioFiles		ExampleAudioFiles
AutosegmentedTextGridCreator.ipynb		AutosegmentedTextGridCreator.ipynb
ExampleAudioFiles.zip		ExampleAudioFiles.zip
Grading.xlsx		Grading.xlsx
README.md		README.md
SilenceRemovalMethods.ipynb		SilenceRemovalMethods.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EST_lang_corpora_preprocessing

Introduction

Requirements for running the Notebooks

SilenceRemovalMethods

AutosegmentedTextGridCreator

Models

Example soundfiles

About

Releases

Packages

Languages

AndreasTeder/EST_lang_corpora_preprocessing

Folders and files

Latest commit

History

Repository files navigation

EST_lang_corpora_preprocessing

Introduction

Requirements for running the Notebooks

SilenceRemovalMethods

AutosegmentedTextGridCreator

Models

Example soundfiles

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages