Add inline(never) to bench systems #9824

nicopap · 2023-09-16T05:49:33Z

Objective

It is difficult to inspect the generated assembly of benchmark systems using a tool such as cargo-asm

Solution

Mark the related functions as #[inline(never)]. This way, you can pass the module name as argument to cargo-asm to get the generated assembly for the given function.

It may have as side effect to make benchmarks a bit more predictable and useful too. As it prevents inlining where in bevy no inlining could possibly take place.

Measurements

Following the recommendations in https://easyperf.net/blog/2019/08/02/Perf-measurement-environment-on-Linux, I

Put my CPU in "AMD ECO" mode, which surprisingly is the equivalent of disabling turboboost, giving more consistent performances
Disabled all hyperthreading cores using echo 0 > /sys/devices/system/cpu/cpu{11,12…}/online
Set the scaling governor to performance
Manually disabled AMD boost with echo 0 > /sys/devices/system/cpu/cpufreq/boost
Set the nice level of the criterion benchmark using cargo bench … & sudo renice -n -5 -p $! ; fg
Not running any other program than the benchmarks (outside of system daemons and the X11 server)

With this setup, running multiple times the same benchmarks on main gives me a lot of "regression" and "improvement" messages, which is absurd given that no code changed.

On this branch, there is still some spurious performance change detection, but they are much less frequent.

This only accounts for iter_simple and iter_frag benchmarks of course.

Why? Because then it becomes easier to inspect generated ASM using a tool like `cargo-asm`.

atlv24

This makes a lot of sense and is well-justified

james7132

LGTM. Do we need #[no_mangle] to make it easier to find the symbols?

nicopap · 2023-10-02T09:09:39Z

cargo-asm is capable of demangling. so for this specific use-case #[no_mangle] is not needed. Though it might be interesting for usage with other tools.

# Objective It is difficult to inspect the generated assembly of benchmark systems using a tool such as `cargo-asm` ## Solution Mark the related functions as `#[inline(never)]`. This way, you can pass the module name as argument to `cargo-asm` to get the generated assembly for the given function. It may have as side effect to make benchmarks a bit more predictable and useful too. As it prevents inlining where in bevy no inlining could possibly take place. ### Measurements Following the recommendations in <https://easyperf.net/blog/2019/08/02/Perf-measurement-environment-on-Linux>, I 1. Put my CPU in "AMD ECO" mode, which surprisingly is the equivalent of disabling turboboost, giving more consistent performances 2. Disabled all hyperthreading cores using `echo 0 > /sys/devices/system/cpu/cpu{11,12…}/online` 3. Set the scaling governor to `performance` 4. Manually disabled AMD boost with `echo 0 > /sys/devices/system/cpu/cpufreq/boost` 5. Set the nice level of the criterion benchmark using `cargo bench … & sudo renice -n -5 -p $! ; fg` 6. Not running any other program than the benchmarks (outside of system daemons and the X11 server) With this setup, running multiple times the same benchmarks on `main` gives me a lot of "regression" and "improvement" messages, which is absurd given that no code changed. On this branch, there is still some spurious performance change detection, but they are much less frequent. This only accounts for `iter_simple` and `iter_frag` benchmarks of course.

Add inline(never) to bench systems

93215f8

Why? Because then it becomes easier to inspect generated ASM using a tool like `cargo-asm`.

nicopap added the C-Usability A targeted quality-of-life change that makes Bevy easier to use label Sep 16, 2023

nicopap marked this pull request as ready for review September 16, 2023 07:53

atlv24 approved these changes Sep 24, 2023

View reviewed changes

james7132 approved these changes Oct 2, 2023

View reviewed changes

james7132 added the S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it label Oct 2, 2023

alice-i-cecile added this pull request to the merge queue Oct 2, 2023

Merged via the queue into bevyengine:main with commit 47409c8 Oct 2, 2023
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inline(never) to bench systems #9824

Add inline(never) to bench systems #9824

nicopap commented Sep 16, 2023 •

edited

Loading

atlv24 left a comment

james7132 left a comment

nicopap commented Oct 2, 2023

Add inline(never) to bench systems #9824

Add inline(never) to bench systems #9824

Conversation

nicopap commented Sep 16, 2023 • edited Loading

Objective

Solution

Measurements

atlv24 left a comment

Choose a reason for hiding this comment

james7132 left a comment

Choose a reason for hiding this comment

nicopap commented Oct 2, 2023

nicopap commented Sep 16, 2023 •

edited

Loading