Audio-classification-with-transfer-learning

This project was developed to classify and identify the 'Normal' and 'Reading' mode of audio sounds. The main task of this project was to develop a prediction tool that can classify between the both classes as either normal or reading. Inspired from the prior works on similar sound classification, we developed a system that uses a set of features extracted from spectral features of sounds like pitch, fundamental frequency, first and second order sound frequency to train a neural network model over a VGG19 pre-trained model (transfer-learning).

#MFCCs-FEATURES

MFCCs are widely used in SVD (singing voice detection) and were first introduced by Davis and Mermelstein in 1980. The use of MFCCs has proven to be a powerful tool in music and voice recognition, and sound recognition in general. (link: https://encyclopedia.pub/entry/18717)

The MFCCs are calculated as follows:

Division of the speech signals into frames, usually by applying a windowing function at fixed intervals;
Computing the coefficients of the discrete Fourier transform on each segment of windowed signal to convert the time domain into the frequency domain;
Taking the logarithm of the amplitude spectrum;
Smoothing the spectrum and emphasizing perceptually meaningful frequencies ;
Taking the discrete cosine transform (DCT) of the list of mel log powers;
Generating cepstrum.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
forensic_audio_cnn_&_transfer_learning.py		forensic_audio_cnn_&_transfer_learning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio-classification-with-transfer-learning

About

Releases

Packages

Languages

shivam55sit/Audio-classification-with-transfer-learning

Folders and files

Latest commit

History

Repository files navigation

Audio-classification-with-transfer-learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages