Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] OrchideaSOL #174

Closed
wants to merge 22 commits into from
Closed

[WIP] OrchideaSOL #174

wants to merge 22 commits into from

Conversation

lostanlen
Copy link
Collaborator

OrchideaSOL is a dataset of 13265 samples, each containing a single musical note from one of 14 different instruments:

    Bass Tuba
    French Horn
    Trombone
    Trumpet in C
    Accordion
    Contrabass
    Violin
    Viola
    Violoncello
    Bassoon
    Clarinet in B-flat
    Flute
    Oboe
    Alto Saxophone

These sounds were originally recorded at Ircam in Paris (France) between 1996 and 1999, as part of a larger project named Studio On Line (SOL). One asset of OrchideaSOL is that it contains many combinations of mutes and extended playing techniques.

The OrchideaSOL audio data can be used for creative purposes insofar at the use complies with the Ircam Forum License. Please visit: https://forum.ircam.fr/legal/contrat-de-licence-forum-ircam/

The OrchideaSOL metadata can be used for creative purposes insofar at the use complies with the Creative Commons Attribution 4.0 International license (see below).

OrchideaSOL can be used for education and research purposes. In particular, it can be employed as a dataset for training and/or evaluating music information retrieval (MIR) systems, for tasks such as instrument recognition, playing technique recognition, or fundamental frequency estimation. For this purpose, we provide an official 5-fold split of OrchideaSOL. This split has been carefully balanced in terms of instrumentation, pitch range, and dynamics. For the sake of research reproducibility, we encourage users of OrchideaSOL to adopt this split and report their results in terms of average performance across folds.

(this PR is part of an under-review paper to ICMC)
(built on top of TinySOL: #165)

Vincent Lostanlen added 21 commits February 7, 2020 20:50
create tinysol_index.json

start writing tinysol.py

write cite for TinySOL

write tinysol docstring

update tinysol parser

update tinysol pretty-print

update tinysol metadata_path

write tinysol.download

add Zenodo urls and checksums to tinysol

add tinysol to mirdata __init__

set tinysol to v3.0

syntax error in tinysol

update make_tinysol_index

re-generate TinySOL index for v3.0

bugfix ANNOTATION_REMOTE in tinysol

run black -S on TinySOL PR

upload TinySOL testing resources

TinySOL test_cite

TinySOL test_validate

TinySOL test_download

TinySOL test_load

TinySOL test_track_ids

TinySOL test_to_jams

update TinySOL_metadata

update TinySOL metadata indexing system

remove WAV extension from TinySOL index

bugfix tinysol module

path descriptions and string-int conversion

update make_tinysol_index with splitext keys

run black style checker on TinySOL

update tinysol test

convert fold to int

convert Resampled field to bool

test tinysol with a string instrument for higher coverage

remove trailing comma in repr_string for tinysol (wind instrument)

write more TinySOL tests

bugfix tinysol bool conversion

bugfix tinysol tests

add Cb-ord-A2-mf-3cR to TinySOL metadata in tests

run black style checker on TinySOL
@lostanlen lostanlen added the new loader request to add a specific dataset loader label Feb 24, 2020
@rabitt
Copy link
Collaborator

rabitt commented Mar 5, 2020

I'm going to wait until #165 is merged before reviewing this so that the diff is clear!

@rabitt
Copy link
Collaborator

rabitt commented Mar 10, 2020

@lostanlen can you rebase this against master now that #165 is merged?

@lostanlen lostanlen added the please-do-not-edit Use this tag if you do not want the maintainers to make commits on your pull request. label Apr 5, 2020
@lostanlen
Copy link
Collaborator Author

This is too out of sync at this point. I am going to start over in the prospect of v0.3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new loader request to add a specific dataset loader please-do-not-edit Use this tag if you do not want the maintainers to make commits on your pull request.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants