Hum Me a Melody

A Computational Creativity project - music video generation from melody and user interaction

Models used

MTG Jamendo Mood Classifier: Trained on an in-house dataset, this model by the Music Technology Group (MTG) at the Universitat Pompeu Fabra (UPF) predicts 56 types of moods and genres on audio files.
MusicGen: Trained on 20,000 hours of licensed music, this model by Meta’s FAIR team generates music based on a text prompt, conditioned on an input audio.
AnimateDiff: Trained on WebVid-10M, a dataset of stock videos, this model by ByteDance generates videos based on a text prompt, using epiCRealism as its text-to-image base model.

Setup steps

Install dependencies with poetry poetry install
Download essentia models and save to tf_graph_files/ folder

wget https://essentia.upf.edu/models/music-style-classification/discogs-effnet/discogs-effnet-bs64-1.pb -P tf_graph_files
wget https://essentia.upf.edu/models/classification-heads/mtg_jamendo_moodtheme/mtg_jamendo_moodtheme-discogs-effnet-1.pb -P tf_graph_files

Run gradio app cd src && poetry run app.py

Notebooks

hum_me_a_melody_gradio_final.ipynb - Google Colab compatible notebook

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
README.md		README.md
hum_me_a_melody_gradio_final.ipynb		hum_me_a_melody_gradio_final.ipynb
hum_me_a_melody_test.ipynb		hum_me_a_melody_test.ipynb
process_diagram.png		process_diagram.png
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hum Me a Melody

Models used

Setup steps

Notebooks

About

Releases

Packages

Languages

wenqinglim/hum_me_a_melody

Folders and files

Latest commit

History

Repository files navigation

Hum Me a Melody

Models used

Setup steps

Notebooks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages