CNN for sound classification

Note: Uncomment the MFCC extraction block to work with your own sounds. Otherwise, I have also provided a sample dataset i.e. dataset.npy.

About dataset.npy

Cepstral Coefficients of dimension (178,44,13) where 178 are number of audio, 44 is the number of samples for each audio and 13 are number of Coefficients.
Labels of dimension (178,)

Using Librosa for Feature Extraction

Generates 44x13 2D image for each sound signal and a Target column (Label)

REVIEW THE GRAPHS BELOW

Quick guide

For each type of sound, create a directory or folder in the audio/ directory.
To see what I mean by that, explore the audio folder in this repository. I have placed an audio as an example.
After you are done making directories for sounds,
place this script in the directory as I have placed it in this repository.

Following is the sequence of transitions that the signal goes through until MFCCs are generated

Waveform

Fourier Transform

Power Spectrum

Spectrogram

Log Spectrogram

MFCCs (Mel Frequency Cepstral Coefficients)

Key points

This implementation rejects the audio signals having lower sample rate than 22050.
Number of MFCCs selected are 13.
Hop length across the signal is 512.
Number of fast fourier transformation is 2048.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
audio		audio
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LFO_Classification.ipynb		LFO_Classification.ipynb
README.md		README.md
dataset.npy		dataset.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CNN for sound classification

About dataset.npy

Using Librosa for Feature Extraction

REVIEW THE GRAPHS BELOW

Quick guide

Following is the sequence of transitions that the signal goes through until MFCCs are generated

Waveform

Fourier Transform

Power Spectrum

Spectrogram

Log Spectrogram

MFCCs (Mel Frequency Cepstral Coefficients)

Key points

About

Releases

Packages

Languages

acen20/cnn-tf-keras-audio-classification

Folders and files

Latest commit

History

Repository files navigation

CNN for sound classification

About dataset.npy

Using Librosa for Feature Extraction

REVIEW THE GRAPHS BELOW

Quick guide

Following is the sequence of transitions that the signal goes through until MFCCs are generated

Waveform

Fourier Transform

Power Spectrum

Spectrogram

Log Spectrogram

MFCCs (Mel Frequency Cepstral Coefficients)

Key points

About

Topics

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages