Randomize Framework order in runs #8830

p8 · 2024-04-01T14:36:40Z

Currently the benchmarks are run in the same order everytime.
Sometimes a run fails after a number of frameworks were benchmarked, or the run is restarted.
This causes the frameworks starting with a to have more test runs than frameworks starting with z.
If the order could be randomized the number of runs would be better distributed.

The text was updated successfully, but these errors were encountered:

fakeshadow · 2024-04-01T16:06:35Z

As someone maintaining a benchmark starts with x I feel this.

That said IMO a fair way of handling order is to prioritize benches with most recent changes. As for benches that haven't changed for a while I guess the maintainers would care less about continuous run result.

joanhey · 2024-04-01T16:21:42Z

I think that is enough to start one run with a and the next with z.
And perhaps we still see some differences in the results.

This run and the next, change the servers, databases, ... so the change it's for all frameworks.
Don't depend from the changes in the frameworks.
A mature framework need less changes than a young one. Still we can bench in local to test small changes.

itrofimow · 2024-04-02T07:03:35Z

What I maintain starts with u, so I'm heavily biased here, but I would also appreciate this change being implemented.

My concern is not about failures or restarts, as they usually don't happen that often when the environment is stable, but rather about a feedback latency:
I mostly use TFB as a measurement tool (and a big shout-out to TE crew for providing that tool), and given a hypothetical performance drop in the ongoing run, I'm left with approx. a day to squeeze a potential fix into the next measurement, and a failure to do so would lead to a feedback latency of two full weeks (every run is approx. a week).
Moreover, any dependency bump I do is at least a week (an almost full run) in terms of feedback latency, and 1.5 weeks on average.

Flipping the order between runs (or FWIW randomizing it) would significantly reduce these latencies for me.

joanhey · 2024-04-02T16:02:21Z

The frameworks that stay in the middle have ~3 days to make changes.
It's the same if the bench begin with a or in reverse order.
The problem is the frameworks that are the last in the run.
Please don't randomize, now we almost know when are the result for our framework.
But we need to flip the order in any new run !!

p8 · 2024-04-02T16:46:21Z

Flipping the order each time makes sense to me.

NateBrady23 · 2024-04-04T13:52:05Z

I like the idea of flipping the order. I'm just getting back from vacation and catching up on a bunch of stuff. Let's get the environment stable, and then I think this is easy to do. Will leave this open until we get it in.

joanhey · 2024-05-02T11:14:07Z

After finish the last full run, the next run did not flip the order.

volyrique · 2024-05-02T20:34:17Z

That's because the tfb-startup.sh script runs tfb-shutdown.sh on startup; the latter is responsible for flipping the order. Is changing the order only after an unsuccessful run by design?

p8 · 2024-05-03T08:18:09Z

I think the following run was reversed: https://tfb-status.techempower.com/results/3c2e9871-9c2a-4ff3-bc31-620f65da4e74. The “last framework” tested is incorrect though.

NateBrady23 · 2024-05-03T21:50:57Z

That's because the tfb-startup.sh script runs tfb-shutdown.sh on startup; the latter is responsible for flipping the order. Is changing the order only after an unsuccessful run by design?

No, I forgot that we actually run the shutdown script twice after a successful run because it's being called from the startup script as well. The design was supposed to be the exact opposite. I'll have to move it to the startup script and it will just reverse every time a run starts.

volyrique · 2024-05-26T22:10:04Z

@NateBrady23 It looks like now the opposite thing is happening - the order is always reversed, i.e. the implementations starting with Z run first.

akupiec · 2024-08-28T22:48:48Z

How about adding an option to run tests in the order of their last execution time, from fastest to slowest? 😈

volyrique · 2024-10-03T23:03:00Z

I am pretty sure that that approach would end up being effectively the same as running them in alphabetical order (or in random order at best), at the cost of significant implementation complexity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Randomize Framework order in runs #8830

Randomize Framework order in runs #8830

p8 commented Apr 1, 2024 •

edited

Loading

fakeshadow commented Apr 1, 2024

joanhey commented Apr 1, 2024

itrofimow commented Apr 2, 2024 •

edited

Loading

joanhey commented Apr 2, 2024

p8 commented Apr 2, 2024

NateBrady23 commented Apr 4, 2024

joanhey commented May 2, 2024

volyrique commented May 2, 2024

p8 commented May 3, 2024

NateBrady23 commented May 3, 2024

volyrique commented May 26, 2024

akupiec commented Aug 28, 2024

volyrique commented Oct 3, 2024

Randomize Framework order in runs #8830

Randomize Framework order in runs #8830

Comments

p8 commented Apr 1, 2024 • edited Loading

fakeshadow commented Apr 1, 2024

joanhey commented Apr 1, 2024

itrofimow commented Apr 2, 2024 • edited Loading

joanhey commented Apr 2, 2024

p8 commented Apr 2, 2024

NateBrady23 commented Apr 4, 2024

joanhey commented May 2, 2024

volyrique commented May 2, 2024

p8 commented May 3, 2024

NateBrady23 commented May 3, 2024

volyrique commented May 26, 2024

akupiec commented Aug 28, 2024

volyrique commented Oct 3, 2024

p8 commented Apr 1, 2024 •

edited

Loading

itrofimow commented Apr 2, 2024 •

edited

Loading