Generalized End-To-End Loss For Speaker Verification

This is my attempt at implementing the paper Generalized End-To-End Loss For Speaker Verification

Still getting started with ML/Python/Docker, but I figured a good way to get up to speed would be to just start implementing papers that seemed interesting to me.

To get started locally with Docker first make a volume to store data/models in:

docker create volume gen-e2e-sv

To build the image, cd to the root of this repository, and run:

docker build . -t d18n/gen-e2e-sv

Then, run the following command, which will spin up the container and start a jupyter server on port 8888

docker run -it --gpus all -p 8888:8888 -v gen-e2e-sv:/workspace/ d18n/gen-e2e-sv

Disclaimer: Much of this code was adapted from reading the paper, and then using https://github.com/CorentinJ/Real-Time-Voice-Cloning as a reference I can't take much credit for this implementation, but I hope to iterate on it at least a little bit

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.vscode/.ropeproject		.vscode/.ropeproject
encoder		encoder
.gitignore		.gitignore
.pylintrc		.pylintrc
Dockerfile		Dockerfile
README.md		README.md
roadmap.md		roadmap.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generalized End-To-End Loss For Speaker Verification

About

Releases

Packages

Languages

d18n/gen-e2e-speaker-verification

Folders and files

Latest commit

History

Repository files navigation

Generalized End-To-End Loss For Speaker Verification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages