BSCRC

Bootleg Score Composer Recognition Challenge 🏆

Datasets

To regenerate the datasets, navigate to dataset_creation.ipynb and change the data_path variable to wherever you would like to save the dataset files. Run the notebook all the way through to get your own 9_way_dataset.pkland 100_way_dataset.pkl. Note that regenerating the datasets requires you to have the predictions from the filler classifier, like ensemble_imslp.tsv found in the filler folder.

The 9-way dataset has 9 hand-selected composers for being generally well known, whereas the 100-way composers has the top 100 composers with the most registered bootleg score events. You can see the lists for both sets of composers in config.

When unpickled, these files store a tuple in the format of (x_train, y_train, meta_train, x_valid, y_valid, meta_valid, x_test, y_test, meta_test).

"x" represents the input features formatted as a list of numpy arrays.

"y" represents the labels formated as a list of integers. The integer values is assigned by the index of the composer after being sorting all of them alphabetically.

"meta" represents the metadata for the fragment, formatted as a tuple of (ID, start_offset). The ID represents the unique ID assigned to the PDF the fragment was grabbed from on the IMSLP archive. The start offset is the starting index of the fragment in the overall bootleg score of the PDF.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
config		config
continuous_models		continuous_models
filler_classifier		filler_classifier
utils		utils
.gitignore		.gitignore
01_dataset_creation.ipynb		01_dataset_creation.ipynb
02_finetuning_data_preprocessing.ipynb		02_finetuning_data_preprocessing.ipynb
03_LM_pretraining_data_preprocessing.ipynb		03_LM_pretraining_data_preprocessing.ipynb
LICENSE		LICENSE
README.md		README.md
evaluation.py		evaluation.py
example.ipynb		example.ipynb
model_training_pipeline.ipynb		model_training_pipeline.ipynb
temp.py		temp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BSCRC

Datasets

About

Releases

Packages

Contributors 2

Languages

License

HMC-MIR/BSCRC

Folders and files

Latest commit

History

Repository files navigation

BSCRC

Datasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages