Simple quality measurement tool #122

avm · 2018-03-12T13:57:28Z

In order to make Audiveris better, we need some way to measure how good it already is. This is an attempt to actually measure the recognition quality with the simplest possible metric: number of notes and rests correctly placed in the recognition result.

This requires Python 3 with the music21 library installed. One sample test case is included (actually taken from issue #78), along with its "ideal" recognition result.

The proposed workflow is this:

before starting work on a recognition issue, add a minimal example of it to test/cases/ (source.png or source.pdf and target.xml);
measure recognition quality with test$ ./diffscore.py -c cases;
fix the issue in the code;
measure quality again, checking that it went up. If it went down, it means that we have either broken something else or not fixed what we intended.

The quality score calculation is quite simplistic right now, just checking pitch and duration for the notes and rests. Later on, within the same framework and using existing test cases, we can start taking into account more stuff (keys, time signatures, measure durations, dynamics, etc).

This compares the notes in two MusicXML files and calculates a simple score, correlated with recognition quality. Python and music21 are required.

hbitteur · 2018-03-13T08:53:52Z

Thanks Alexander. We are more and more aware of the need for such regression management, but failed to provide one for lack of high-level measurement tool.
Your proposal sounds very simple but may well be very useful, especially within the context of continuous integration, which is still to be set up.
@maximumspatium is the right person to work with you on this topic, so I took the liberty to assign him this issue. He may not be immediately available, but please don't give up.

maximumspatium · 2018-03-13T09:01:13Z

@maximumspatium is the right person to work with you on this topic, so I took the liberty to assign him this issue.

Thanks :)

He may not be immediately available, but please don't give up.

That's true. As a hobbyist programmer, I'm doing all this work in my spare time.
@avm Please be patient - I'll look into your proposal in the next days and give you a feedback.

guillaumerose · 2022-02-02T19:59:13Z

This regression test looks good to me. Since this project is in Java, it will be more natural to JUnit. Happy to do it if you are interested.

hbitteur · 2022-02-02T20:34:18Z

Of course this would be interesting for the project.
But the initial message was posted several years ago.
See what you can do to re-activate this.

@avm

As provided in PR Audiveris#122 by Alexander Myltsev (@avm).

PeterGreth · 2022-04-03T17:36:10Z

Hello everybody!

Thanks @avm for this nice contribution! I think tracking recognition quality over time/commits would indeed be very valuable 😃

I had a few spare hours and felt like I could contribute to this awesome project by providing the equivalent implementation in Java/Junit. @guillaumerose I hope you hadn't started yet - at least I could not find a branch or commits on your Audiveris fork.

For clarity, I created a separate PR: #563

Cheers
Peter

avm added 3 commits March 12, 2018 15:47

A simple tool for quality measurement.

7e4a148

This compares the notes in two MusicXML files and calculates a simple score, correlated with recognition quality. Python and music21 are required.

diffscore.py: support processing the cases/ directory.

7bb218a

Fix pyflakes warnings.

2d6796c

hbitteur assigned maximumspatium Mar 13, 2018

PeterGreth added a commit to PeterGreth/audiveris that referenced this pull request Apr 3, 2022

First regression test case "01-klavier"

b2b2938

As provided in PR Audiveris#122 by Alexander Myltsev (@avm).

PeterGreth mentioned this pull request Apr 3, 2022

Simple quality measurement tool - Java adaptation #563

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple quality measurement tool #122

Simple quality measurement tool #122

avm commented Mar 12, 2018 •

edited by maximumspatium

Loading

hbitteur commented Mar 13, 2018

maximumspatium commented Mar 13, 2018

guillaumerose commented Feb 2, 2022

hbitteur commented Feb 2, 2022

PeterGreth commented Apr 3, 2022

Simple quality measurement tool #122

Are you sure you want to change the base?

Simple quality measurement tool #122

Conversation

avm commented Mar 12, 2018 • edited by maximumspatium Loading

hbitteur commented Mar 13, 2018

maximumspatium commented Mar 13, 2018

guillaumerose commented Feb 2, 2022

hbitteur commented Feb 2, 2022

PeterGreth commented Apr 3, 2022

avm commented Mar 12, 2018 •

edited by maximumspatium

Loading