audio-alphabetizer

An automated tool for alphabetizing audio files, inspired by weird videos such as this one, which I find to be strangely funny for no particular reason.

This project uses the following libraries:

CMUSphinx to obtain the timing of the words
PyDub to splice and join the audio accordingly

With that said, the usage of PyDub is only temporary at the moment, since I would eventually like to have all of the source code in Java so I can make it all into one executable JAR file.

Please note that CMUSphinx is very picky in the format of the audio it aligns. The audio must have the following format exactly, as per the CMUSphinx website:

RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 16000 Hz

Also, in order for it to pick up every word there must be little to no background noise in the recording. Here is what I've tried:

Source	Result
Road work ahead?	Works flawlessly
Scott Bradford	Works well until the end
I can't believe you've done this	Works flawlessly (although I did cut out the last ~2 seconds)
No Vegetal	A few words are missed (particularly the ones with the overboosted audio)
Icced	Barely anything is picked up
Steamed Hams	Almost everything, except for the part with the theme song, which is actually quite the opposite

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
audio_files		audio_files
src		src
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
spliceAudio.py		spliceAudio.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio-alphabetizer

About

Releases

Packages

Languages

License

KyleM32767/audio-alphabetizer

Folders and files

Latest commit

History

Repository files navigation

audio-alphabetizer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages