[mono][interp] Fix short branches #58806

BrzVlad · 2021-09-08T14:30:20Z

In interp, long branches have 4 byte signed offset while short branches have a 2 byte signed offset. Ideally we would want to always emit short branches if possible due to improved performance. The problem is that we emit a short or long branch early in the codegen process, there is no way to know whether a branch is long or short at that time and no further computations were done later to determine whether the branch is really long or short. Before this commit we arbitrarily decided that all branches in methods with IL size of less than 25000 are short and, in some cases, completely ignored that long branches actually happen in practice.

This commit makes it such that we always emit long opcodes at the beginning of the codegen process, also serving as a simplification since optimizations operating on IR code don't need to care about both short and long versions of branches. Later on, we will end up converting all these long branches to short branches when emitting the final method code. We achieve this by doing a quick preliminary code iteration and computing conservative native offsets (assuming all branches are long). Since the final code will be equal in size or smaller, we have the guarantee that if a branch is short between the conservative offsets, it will surely be short also in the final code. With this approach we guarantee correctness and we can fail to shorten only a negligible amount of branches.

Since the super instruction pass generates some branching instructions that are supported only for the short version, we need to run a computation of conservative offsets beforehand. These iterations can later be optimized out, by prioritizing general compilation time over marginal improvements in code quality for very rare gigantic methods.

Fixes #57363

ghost · 2021-09-08T14:30:27Z

Tagging subscribers to this area: @BrzVlad
See info in area-owners.md if you want to be subscribed.

Issue Details

In interp, long branches have 4 byte signed offset while short branches have a 2 byte signed offset. Ideally we would want to always emit short branches if possible due to improved performance. The problem is that we emit a short or long branch early in the codegen process, there is no way to know whether a branch is long or short at that time and no further computations were done later to determine whether the branch is really long or short. Before this commit we arbitrarily decided that all branches in methods with IL size of less than 25000 are short and, in some cases, completely ignored that long branches actually happen in practice.

This commit makes it such that we always emit long opcodes at the beginning of the codegen process, also serving as a simplification since optimizations operating on IR code don't need to care about both short and long versions of branches. Later on, we will end up converting all these long branches to short branches when emitting the final method code. We achieve this by doing a quick preliminary code iteration and computing conservative native offsets (assuming all branches are long). Since the final code will be equal in size or smaller, we have the guarantee that if a branch is short between the conservative offsets, it will surely be short also in the final code. With this approach we guarantee correctness and we can fail to shorten only a negligible amount of branches.

Since the super instruction pass generates some branching instructions that are supported only for the short version, we need to run a computation of conservative offsets beforehand. These iterations can later be optimized out, by prioritizing general compilation time over marginal improvements in code quality for very rare gigantic methods.

Fixes #57363

Author:	BrzVlad
Assignees:	-
Labels:	`area-Codegen-Interpreter-mono`
Milestone:	-

marek-safar · 2021-09-16T06:36:58Z

@vargaz please review

BrzVlad · 2021-09-16T20:44:00Z

wasm failures are fixed by #59225

In interp, long branches have 4 byte signed offset while short branches have a 2 byte signed offset. Ideally we would want to always emit short branches if possible due to improved performance. The problem is that we emit a short or long branch early in the codegen process, there is no way to know whether a branch is long or short at that time and no further computations were done later to determine whether the branch is really long or short. Before this commit we arbitrarily decided that all branches in methods with IL size of less than 25000 are short and, in some cases, completely ignored that long branches actually happen in practice. This commit makes it such that we always emit long opcodes at the beginning of the codegen process, also serving as a simplification since optimizations operating on IR code don't need to care about both short and long versions of branches. Later on, we will end up converting all these long branches to short branches when emitting the final method code. We achieve this by doing a quick preliminary code iteration and computing conservative native offsets (assuming all branches are long). Since the final code will be equal in size or smaller, we have the guarantee that if a branch is short between the conservative offsets, it will surely be short also in the final code. With this approach we guarantee correctness and we can fail to shorten only a negligible amount of branches. Since the super instruction pass generates some branching instructions that are supported only for the short version, we need to run a computation of conservative offsets beforehand.

vargaz · 2021-10-12T07:17:22Z

Wouldn't be better to merge all the long branches into 1 instruction, they are probably pretty rare ?

BrzVlad · 2021-10-12T08:15:46Z

@vargaz Yeah, however it requires a bit more changes. We should do that at some point.

thaystg · 2022-07-26T19:15:15Z

@BrzVlad Can we backport to .net6 this PR? We have this issue related to it: #63581

thaystg · 2022-07-28T16:14:49Z

/backport to release/6.0

BrzVlad requested a review from vargaz as a code owner September 8, 2021 14:30

dotnet-issue-labeler bot added the area-Codegen-Interpreter-mono label Sep 8, 2021

BrzVlad added 2 commits September 20, 2021 11:20

Re-enable test suite

22b2a7d

BrzVlad force-pushed the fix-interp-br branch from 4b9d4da to 22b2a7d Compare September 20, 2021 08:20

vargaz approved these changes Oct 12, 2021

View reviewed changes

BrzVlad merged commit 383a479 into dotnet:main Oct 12, 2021

ghost locked as resolved and limited conversation to collaborators Nov 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mono][interp] Fix short branches #58806

[mono][interp] Fix short branches #58806

BrzVlad commented Sep 8, 2021

ghost commented Sep 8, 2021

marek-safar commented Sep 16, 2021 •

edited

Loading

BrzVlad commented Sep 16, 2021

vargaz commented Oct 12, 2021

BrzVlad commented Oct 12, 2021

thaystg commented Jul 26, 2022

thaystg commented Jul 28, 2022

[mono][interp] Fix short branches #58806

[mono][interp] Fix short branches #58806

Conversation

BrzVlad commented Sep 8, 2021

ghost commented Sep 8, 2021

marek-safar commented Sep 16, 2021 • edited Loading

BrzVlad commented Sep 16, 2021

vargaz commented Oct 12, 2021

BrzVlad commented Oct 12, 2021

thaystg commented Jul 26, 2022

thaystg commented Jul 28, 2022

marek-safar commented Sep 16, 2021 •

edited

Loading