speaker_recognition_GMM_UBM

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Installing dependencies

To install all the dependencies for this project, run the following command,

pip3 install -r requirements.txt

Extracing MFCC from audio

To extract MFCC coefficients from audio samples, put all the audio files in a seperate folder and run the following command,

python3 src/speaker_recognition/extract_mfcc_coefficients.py
--audio_folder <path to the folder which contains audio>
--csv_file_name <name of the csv file that will be created>
--opt combined

Creating Universal Background Model

To run UBM training run the following code,

python3 src/speaker_recognition/speaker_recognition.py 
--csv_file <path to MFCC coefficients file> 
--operation ubm

Map adaptation using the created GMM-UBM model

To run MAP adaptation,

python3 src/speaker_recognition/speaker_recognition.py 
--csv_file <path to MFCC coefficients file> 
--operation map 
--ubm_file <path to the ubm file created after GMM-UBM model creation>

For testing the map adapted model,

python3 src/speaker_recognition/testing_model.py
--map_file_name <path to map adapted .npy file>
--ubm_file_name <path to ubm .npy file>
--test_csv_file <path to the csv file of test speaker>
--N 1500

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
numpy_files		numpy_files
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
results_for_speaker_recognition.ipynb		results_for_speaker_recognition.ipynb
testing_hypothesis.ipynb		testing_hypothesis.ipynb
visualizing_speech_features.ipynb		visualizing_speech_features.ipynb
voice_activity_detection.ipynb		voice_activity_detection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speaker_recognition_GMM_UBM

Installing dependencies

Extracing MFCC from audio

Creating Universal Background Model

Map adaptation using the created GMM-UBM model

About

Releases

Packages

Languages

scelesticsiva/speaker_recognition_GMM_UBM

Folders and files

Latest commit

History

Repository files navigation

speaker_recognition_GMM_UBM

Installing dependencies

Extracing MFCC from audio

Creating Universal Background Model

Map adaptation using the created GMM-UBM model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages