coverage: Use a separate counter type and simplification step during counter creation #133849

Zalathar · 2024-12-04T09:30:51Z

When instrumenting a function's MIR for coverage, there is a point where we need to decide, for each node in the control-flow graph, whether its execution count will be tracked by a physical counter, or by an expression that combines physical counters from other parts of the graph.

Currently the code for doing that is heavily tied to the final form of the LLVM coverage mapping format, and performs some important simplification steps on-the-fly. These factors make the code extremely difficult to modify without breaking or massively worsening the resulting coverage-instrumentation metadata.

This PR aims to improve that situation somewhat by adding an extra intermediate representation between the code that chooses how each node will be counted, and the code that converts those decisions into actual tables of physical counters and trees of counter expressions.

As part of doing that, some of the simplifications that are currently performed during the main counter creation step have been pulled out into a separate step.

In most cases the resulting coverage metadata is equivalent, slightly better, or slightly worse. The biggest outlier is counters.rs, where the coverage metadata ends up about 10% larger. This seems to be the result of the new approach having less subexpression sharing (because it relies on flatten-sort-cancel), and therefore being less effective at taking advantage of MIR optimizations to replace counters for unused control-flow with zeroes. I think the modest downside is acceptable in light of the future possibilities opened up by this decoupling.

A "site" is a node or edge in the coverage graph.

This is more convenient for subsequent patches.

These simplifications are now handled by the transcribe step.

rustbot · 2024-12-04T09:31:02Z

r? @petrochenkov

rustbot has assigned @petrochenkov.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

rustbot · 2024-12-04T09:31:04Z

Some changes occurred to MIR optimizations

cc @rust-lang/wg-mir-opt

Zalathar · 2024-12-04T09:32:27Z

This churns all of the coverage tests, so it will conflict with any other PR (e.g. #133089) that re-blesses the .cov-map snapshots.

oli-obk · 2024-12-04T10:27:18Z

compiler/rustc_mir_transform/src/coverage/counters.rs

+        }
+    }
+
+    fn transcribe_counters(mut self) -> CoverageCounters {


So this is basically a normalization if there are no old counters and a system for keeping the diff small if there are old counters?

This is kind of tricky to explain, but it's also the whole crux of this PR, so I should try.

We can think of this code as going through a series of refactoring stages:

Graph traversal → CoverageCounters (status quo)

Graph traversal → CoverageCounters → Transcriber → simplified CoverageCounters

Graph traversal → FxHashMap<Site, SiteCounter> → Transcriber → simplified CoverageCounters

The main goal of introducing Transcriber as a middle layer is so that the part before Transcriber can be changed to not be tied to CoverageCounters. To make that feasible, we need to go through the intermediate step of having two different CoverageCounters (old and new), so that we can then replace the first one with something else.

The fact that final CoverageCounters is simpler than the original one starts off as being a bonus extra, but it also lets the earlier steps not care so much about producing “optimal” results in a single pass. I expect that to be a big help in future changes to how counter creation works.

That makes sense, thanks!

compiler/rustc_mir_transform/src/coverage/counters.rs

oli-obk · 2024-12-04T10:30:33Z

r? @oli-obk

oli-obk · 2024-12-04T12:14:39Z

@bors r+

bors · 2024-12-04T12:14:43Z

📌 Commit ba08056 has been approved by oli-obk

It is now in the queue for this repository.

…iaskrgr Rollup of 8 pull requests Successful merges: - rust-lang#133737 (Include LLDB and GDB visualizers in MSVC distribution) - rust-lang#133774 (Make CoercePointee errors translatable) - rust-lang#133831 (Don't try and handle unfed `type_of` on anon consts) - rust-lang#133847 (Remove `-Zshow-span`.) - rust-lang#133849 (coverage: Use a separate counter type and simplification step during counter creation) - rust-lang#133850 (Avoid `opaque type not constrained` errors in the presence of other errors) - rust-lang#133851 (Stop git from merging generated files) - rust-lang#133856 (Update sysinfo version to 0.33.0) r? `@ghost` `@rustbot` modify labels: rollup

Rollup merge of rust-lang#133849 - Zalathar:replay, r=oli-obk coverage: Use a separate counter type and simplification step during counter creation When instrumenting a function's MIR for coverage, there is a point where we need to decide, for each node in the control-flow graph, whether its execution count will be tracked by a physical counter, or by an expression that combines physical counters from other parts of the graph. Currently the code for doing that is heavily tied to the final form of the LLVM coverage mapping format, and performs some important simplification steps on-the-fly. These factors make the code extremely difficult to modify without breaking or massively worsening the resulting coverage-instrumentation metadata. --- This PR aims to improve that situation somewhat by adding an extra intermediate representation between the code that chooses how each node will be counted, and the code that converts those decisions into actual tables of physical counters and trees of counter expressions. As part of doing that, some of the simplifications that are currently performed during the main counter creation step have been pulled out into a separate step. In most cases the resulting coverage metadata is equivalent, slightly better, or slightly worse. The biggest outlier is `counters.rs`, where the coverage metadata ends up about 10% larger. This seems to be the result of the new approach having less subexpression sharing (because it relies on flatten-sort-cancel), and therefore being less effective at taking advantage of MIR optimizations to replace counters for unused control-flow with zeroes. I think the modest downside is acceptable in light of the future possibilities opened up by this decoupling.

Zalathar added 6 commits December 4, 2024 17:00

coverage: Extract subtracted_sum in counter creation

2a3b4a0

coverage: Rename CounterIncrementSite to just Site

7ecc677

A "site" is a node or edge in the coverage graph.

coverage: Use a single make_phys_counter method

aca6dba

This is more convenient for subsequent patches.

coverage: Add an extra "transcribe" step after counter creation

44e4e45

coverage: Use a separate counter type during counter creation

d7090f3

coverage: Remove the expression simplifier from CoverageCounters

ba08056

These simplifications are now handled by the transcribe step.

Zalathar added the A-code-coverage Area: Source-based code coverage (-Cinstrument-coverage) label Dec 4, 2024

rustbot assigned petrochenkov Dec 4, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Dec 4, 2024

oli-obk reviewed Dec 4, 2024

View reviewed changes

compiler/rustc_mir_transform/src/coverage/counters.rs Show resolved Hide resolved

rustbot assigned oli-obk and unassigned petrochenkov Dec 4, 2024

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Dec 4, 2024

Zalathar mentioned this pull request Dec 4, 2024

Stabilize noop_waker #133089

Open

Jeff9049 approved these changes Dec 4, 2024

View reviewed changes

matthiaskrgr mentioned this pull request Dec 4, 2024

Rollup of 8 pull requests #133865

Merged

bors merged commit 553db5f into rust-lang:master Dec 4, 2024
6 checks passed

rustbot added this to the 1.85.0 milestone Dec 4, 2024

Zalathar deleted the replay branch December 4, 2024 21:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coverage: Use a separate counter type and simplification step during counter creation #133849

coverage: Use a separate counter type and simplification step during counter creation #133849

Zalathar commented Dec 4, 2024

rustbot commented Dec 4, 2024

rustbot commented Dec 4, 2024

Zalathar commented Dec 4, 2024 •

edited

Loading

oli-obk Dec 4, 2024

Zalathar Dec 4, 2024

Zalathar Dec 4, 2024

oli-obk Dec 4, 2024

oli-obk commented Dec 4, 2024

oli-obk commented Dec 4, 2024

bors commented Dec 4, 2024

coverage: Use a separate counter type and simplification step during counter creation #133849

coverage: Use a separate counter type and simplification step during counter creation #133849

Conversation

Zalathar commented Dec 4, 2024

rustbot commented Dec 4, 2024

rustbot commented Dec 4, 2024

Zalathar commented Dec 4, 2024 • edited Loading

oli-obk Dec 4, 2024

Choose a reason for hiding this comment

Zalathar Dec 4, 2024

Choose a reason for hiding this comment

Zalathar Dec 4, 2024

Choose a reason for hiding this comment

oli-obk Dec 4, 2024

Choose a reason for hiding this comment

oli-obk commented Dec 4, 2024

oli-obk commented Dec 4, 2024

bors commented Dec 4, 2024

Zalathar commented Dec 4, 2024 •

edited

Loading