Benchmark downloader script should take the latest score, not the smallest #8707

Akirathan · 2024-01-08T15:57:42Z

Thanks to the observations of @JaroslavTulach, I have noticed that on our benchmark results pages (for both engine and stdlib), we have a lot of nonsensical data.

Problem description

Every benchmark run generates bench-results.xml file with a format like this:

        <case>
            <label>org.enso.benchmarks.generated.Startup.empty_startup</label>
            <scores>
                <score>2823.877618</score>
            </scores>
        </case>

If the runner is not clean, the bench-results.xml file is not rewritten, but rather a new score value is appended, like so:

        <case>
            <label>org.enso.benchmarks.generated.Startup.empty_startup</label>
            <scores>
                <score>2823.877618</score>
                <score>3206.157912</score>
            </scores>
        </case>

The procedure in the bench_download.py script used to choose the lowest of these numbers:
https://github.com/enso-org/enso/blob/develop/tools/performance/engine-benchmarks/bench_download.py#L280-L284
This is obviously wrong. This PR fixes this by choosing the latest score, that is the last value.

TL;DR;

All our benchmark data from runners that were not clean are wrong! On this picture, on the right there is the old and wrong value, and on the left is the correct one:

The data was fetched from this snippet:

Akirathan · 2024-01-08T17:54:32Z

I have just regenerated the results on the servers on https://enso-org.github.io/engine-benchmark-results/. They are now correct and up-to-date.

My pageEngine and Standard libs benchmarks

JaroslavTulach · 2024-01-09T07:02:28Z

Taking the last value is a good hotfix for now, but we need also fix in the CI. There is a removal code at

enso/build/build/src/engine/context.rs

Line 226 in 4983550

    
           // Remove the benchmark reports. They are not meant currently to be incrementally

but it only works for engine/runtime benchmarks.

We need to make sure it also works for std-benchmarks. Why not remove bench*xml in the whole repository, @mwu-tow?

As requested in #8707 (comment) and discussed on Discord.

bench_download.py script takes the last score data, not the smallest one

ae4389d

Akirathan self-assigned this Jan 8, 2024

Akirathan requested review from 4e6, JaroslavTulach and radeusgd as code owners January 8, 2024 15:57

radeusgd approved these changes Jan 8, 2024

View reviewed changes

Akirathan added the CI: No changelog needed Do not require a changelog entry for this PR. label Jan 8, 2024

Akirathan added the CI: Ready to merge This PR is eligible for automatic merge label Jan 8, 2024

enso-bot bot mentioned this pull request Jan 8, 2024

Refactor Standard.Test to the builder API #7566

Closed

hubertp approved these changes Jan 8, 2024

View reviewed changes

mergify bot merged commit 4983550 into develop Jan 8, 2024
33 of 36 checks passed

mergify bot deleted the wip/akirathan/fix-bench-data branch January 8, 2024 19:17

mwu-tow mentioned this pull request Jan 15, 2024

CI: Remove bench-report.xml in the whole repo subtree #8762

Merged

5 tasks

mergify bot pushed a commit that referenced this pull request Jan 18, 2024

CI: Remove bench-report.xml in the whole repo subtree (#8762)

32d0459

As requested in #8707 (comment) and discussed on Discord.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark downloader script should take the latest score, not the smallest #8707

Benchmark downloader script should take the latest score, not the smallest #8707

Akirathan commented Jan 8, 2024 •

edited

Loading

Akirathan commented Jan 8, 2024 •

edited by unfurl-links bot

Loading

JaroslavTulach commented Jan 9, 2024

Benchmark downloader script should take the latest score, not the smallest #8707

Benchmark downloader script should take the latest score, not the smallest #8707

Conversation

Akirathan commented Jan 8, 2024 • edited Loading

Problem description

TL;DR;

Akirathan commented Jan 8, 2024 • edited by unfurl-links bot Loading

JaroslavTulach commented Jan 9, 2024

Akirathan commented Jan 8, 2024 •

edited

Loading

Akirathan commented Jan 8, 2024 •

edited by unfurl-links bot

Loading