Interact with the full pipeline on HF Spaces!
- Install dependencies using Poetry:
poetry install
- Download the dataset MIR-1K
poetry run src/source_sep.py
Trained models on Hugging Face:
To run training pipeline:
poetry run src/train_pitch_transcription.py
Training scripts:
convtasnet.ipynb
: Evaluate pre-trained Conv-TasNet model for vocal seprationtest_crepe.ipynb
: Train CREPE model for pitch transcriptioncrepe_model.py
: Define CREPE model
Pipeline:
combined_pipeline.ipynb
: combines both Conv-TasNet and trained CREPE models into a single pipelinecrepe_model.py
Artifacts:
best_crepe_xx.pkl
: Stores best trained CREPE model for each model sizeleon_7_jmzen_5.wav
: mixed track used for case studygbqq_lwq_mixed.wav
: mixed track by me (lol) to test in-sync vocals
Others:
analyze_pitch_labels
: initial analysis on distribution of pitches in dataset
- Setup Poetry
- Move from notebook to script
- Test individual scripts in
src/
- Get intermediate audio samples for writeup post
- Create e2e pipeline
- Host model(s) on HF
- Add gradio on HF Spaces hehe