FingerprintDNN

Fast pitch tracking for musical instruments using a Keras MLP trained on the NSynth Dataset (Engel et. al)

NSynth Dataset: https://magenta.tensorflow.org/datasets/nsynth

Audio fingerprinting is an algorithm typically used for quick database matching of audio files, and is the primary method used by Shazam for song recognition. This implementation does not make use of the hashing aspect of fingerprinting, and instead uses the raw binarized spectrogram data to train the network. This project experiments with these forms of simplified spectrograms in order to explore the validity of using this highly effective and ubiquitous processing method for deep learning.

Fingerprint extraction based on DejaVu audio fingerprinting in python: https://github.com/worldveil/dejavu

Jesse Engel, Cinjon Resnick, Adam Roberts, Sander Dieleman, Douglas Eck, Karen Simonyan, and Mohammad Norouzi. "Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders." 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
__pycache__		__pycache__
sample		sample
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
extract_fingerprint.py		extract_fingerprint.py
fingerprint.py		fingerprint.py
generate_fingerprints.py		generate_fingerprints.py
nsynth_batch_loader.py		nsynth_batch_loader.py
nsynth_preprocessing.py		nsynth_preprocessing.py
prepare_nsynth_fingerprints.py		prepare_nsynth_fingerprints.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FingerprintDNN

About

Releases

Packages

Languages

carlmoore256/FingerprintDNN

Folders and files

Latest commit

History

Repository files navigation

FingerprintDNN

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages