Project: DNN Speech Recognizer

In this notebook, you will build a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline!

We begin by investigating the LibriSpeech dataset that will be used to train and evaluate your models. Your algorithm will first convert any raw audio to feature representations that are commonly used for ASR. You will then move on to building neural networks that can map these audio features to transcribed text. After learning about the basic types of layers that are often used for deep learning-based approaches to ASR, you will engage in your own investigations by creating and testing your own state-of-the-art models. Throughout the notebook, we provide recommended research papers for additional reading and links to GitHub repositories with interesting implementations.

Tasks

The tasks for this project are outlined in the vui_notebook.ipynb in three steps. Follow all the instructions, which include implementing code in sample_models.py, answering questions, and providing results. The following list is a summary of the required tasks.

Step 1 - Feature Extraction

Execute all code cells to extract features from raw audio

Step 2 - Acoustic Model

Implement the code for Models 1, 2, 3, and 4 in sample_models.py
Train Models 0, 1, 2, 3, 4 in the notebook
Execute the comparison code in the notebook
Answer Question 1 in the notebook regarding the comparison
Implement the code for the Final Model in sample_models.py
Train the Final Model in the notebook
Answer Question 2 in the notebook regarding your final model

Step 3 - Decoder

Execute the prediction code in the notebook

Written with StackEdit.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
images		images
results		results
DNN Speech Recognizer.md		DNN Speech Recognizer.md
LICENSE		LICENSE
README.md		README.md
char_map.py		char_map.py
data_generator.py		data_generator.py
sample_models.py		sample_models.py
train_corpus.json		train_corpus.json
train_utils.py		train_utils.py
utils.py		utils.py
valid_corpus.json		valid_corpus.json
vui_notebook-zh.ipynb		vui_notebook-zh.ipynb
vui_notebook.ipynb		vui_notebook.ipynb
workspace-utils.py		workspace-utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: DNN Speech Recognizer

Tasks

Step 1 - Feature Extraction

Step 2 - Acoustic Model

Step 3 - Decoder

About

Releases

Packages

Languages

License

AbdullahMu/DNN_Speech_Recognizer

Folders and files

Latest commit

History

Repository files navigation

Project: DNN Speech Recognizer

Tasks

Step 1 - Feature Extraction

Step 2 - Acoustic Model

Step 3 - Decoder

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages