At opt-levels <= 1 the arithmetic operation methods do not get inlined, preventing other optimisations #75598

nagisa · 2020-08-16T17:15:47Z

Consider code like this:

#![feature(core_intrinsics)]

pub fn method(a: usize, b: usize) -> usize {
    a.wrapping_sub(b)
}

pub fn intrinsic(a: usize, b: usize) -> usize {
    std::intrinsics::wrapping_sub(a, b)
}

compiler explorer

at -Copt-level=1 and lower, the generated assembly for versions with the method call will generate a function call, rather than direct operation. Once the inlining fails, other optimisations that could be done are inhibited, especially at -Copt-level=1`.

More generally speaking, I wonder if we might want to make these methods have a special annotation that would make the compiler generate the instructions directly much like it does for intrinsics right now. These seem like basic enough that #[inline(always)] might not be good enough (it being just a hint) and also possibly more expensive than necessary (something needs to do the inlining still).

The text was updated successfully, but these errors were encountered:

alecmocatta · 2020-08-16T20:17:31Z

Relatedly, #74362 is an instance where not inlining a function that wraps an intrinsic inhibits DCE.

bugadani · 2020-09-20T12:54:42Z

#[inline(always)] might not be good enough

My naive assumption is that inline(always) would inline shorter functions more eagerly, since the relative cost of a function call is higher and also shorter functions don't contribute as much to binary bloat. Is this not the case?

Lokathor · 2021-03-07T01:35:05Z

Using the compiler explorer link, it seems like simply tagging the wrapping_ operations as #[inline(always)] will make them get inlined even at opt-level=0

AngelicosPhosphoros · 2021-04-04T00:37:20Z

These seem like basic enough that #[inline(always)] might not be good enough (it being just a hint) and also possibly more expensive than necessary (something needs to do the inlining still).

This is not true for opt-level=1, AFAIK. In this mode, LLVM runs always-inline which inlines functions without much thinking.
In higher optimization levels this pass doesn't run and inline(always) becomes hint.

AngelicosPhosphoros · 2021-04-05T16:08:46Z

Example of prevented optimization.
godbolt

…nline-always-arithmetic, r=nagisa Add some #[inline(always)] to arithmetic methods of integers I tried to add it only to methods which return results of intrinsics and don't have any branching. Branching could made performance of debug builds (`-Copt-level=0`) worse. Main goal of changes is allowing wider optimizations in `-Copt-level=1`. Closes: rust-lang#75598 r? `@nagisa`

This is a follow-up change to the fix for rust-lang#75598. It simplifies the implementation of wrapping_neg() for all integer types by just calling 0.wrapping_sub(self) and always inlines it. This leads to much less assembly code being emitted for opt-level≤1.

…m-ou-se Make wrapping_neg() use wrapping_sub(), #[inline(always)] This is a follow-up change to the fix for rust-lang#75598. It simplifies the implementation of wrapping_neg() for all integer types by just calling 0.wrapping_sub(self) and always inlines it. This leads to much less assembly code being emitted for opt-level≤1 and thus much better performance for debug-compiled code. Background is [this discussion on the internals forum](https://internals.rust-lang.org/t/why-does-rust-generate-10x-as-much-unoptimized-assembly-as-gcc/14930).

nagisa added I-slow Issue: Problems and improvements with respect to performance of generated code. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Aug 16, 2020

nagisa mentioned this issue Mar 7, 2021

wrapping_{op} is not inlined in debug mode #82821

Closed

AngelicosPhosphoros mentioned this issue Apr 10, 2021

Add some #[inline(always)] to arithmetic methods of integers #84061

Merged

bors closed this as completed in f8a12c6 Apr 18, 2021

hkratz mentioned this issue Jul 15, 2021

Make wrapping_neg() use wrapping_sub(), #[inline(always)] #87150

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

At opt-levels <= 1 the arithmetic operation methods do not get inlined, preventing other optimisations #75598

At opt-levels <= 1 the arithmetic operation methods do not get inlined, preventing other optimisations #75598

nagisa commented Aug 16, 2020 •

edited

Loading

alecmocatta commented Aug 16, 2020

bugadani commented Sep 20, 2020 •

edited

Loading

Lokathor commented Mar 7, 2021

AngelicosPhosphoros commented Apr 4, 2021

AngelicosPhosphoros commented Apr 5, 2021

At opt-levels <= 1 the arithmetic operation methods do not get inlined, preventing other optimisations #75598

At opt-levels <= 1 the arithmetic operation methods do not get inlined, preventing other optimisations #75598

Comments

nagisa commented Aug 16, 2020 • edited Loading

alecmocatta commented Aug 16, 2020

bugadani commented Sep 20, 2020 • edited Loading

Lokathor commented Mar 7, 2021

AngelicosPhosphoros commented Apr 4, 2021

AngelicosPhosphoros commented Apr 5, 2021

nagisa commented Aug 16, 2020 •

edited

Loading

bugadani commented Sep 20, 2020 •

edited

Loading