Use llvm::computeLTOCacheKey to determine post-ThinLTO CGU reuse #76859

Aaron1011 · 2020-09-18T02:06:04Z

During incremental ThinLTO compilation, we attempt to re-use the
optimized (post-ThinLTO) bitcode file for a module if it is 'safe' to do
so.

Up until now, 'safe' has meant that the set of modules that our current
modules imports from/exports to is unchanged from the previous
compilation session. See PR #67020 and PR #71131 for more details.

However, this turns out be insufficient to guarantee that it's safe
to reuse the post-LTO module (i.e. that optimizing the pre-LTO module
would produce the same result). When LLVM optimizes a module during
ThinLTO, it may look at other information from the 'module index', such
as whether a (non-imported!) global variable is used. If this
information changes between compilation runs, we may end up re-using an
optimized module that (for example) had dead-code elimination run on a
function that is now used by another module.

Fortunately, LLVM implements its own ThinLTO module cache, which is used
when ThinLTO is performed by a linker plugin (e.g. when clang is used to
compile a C proect). Using this cache directly would require extensive
refactoring of our code - but fortunately for us, LLVM provides a
function that does exactly what we need.

The function llvm::computeLTOCacheKey is used to compute a SHA-1 hash
from all data that might influence the result of ThinLTO on a module.
In addition to the module imports/exports that we manually track, it
also hashes information about global variables (e.g. their liveness)
which might be used during optimization. By using this function, we
shouldn't have to worry about new LLVM passes breaking our module re-use
behavior.

In LLVM, the output of this function forms part of the filename used to
store the post-ThinLTO module. To keep our current filename structure
intact, this PR just writes out the mapping 'CGU name -> Hash' to a
file. To determine if a post-LTO module should be reused, we compare
hashes from the previous session.

This should unblock PR #75199 - by sheer chance, it seems to have hit
this issue due to the particular CGU partitioning and optimization
decisions that end up getting made.

During incremental ThinLTO compilation, we attempt to re-use the optimized (post-ThinLTO) bitcode file for a module if it is 'safe' to do so. Up until now, 'safe' has meant that the set of modules that our current modules imports from/exports to is unchanged from the previous compilation session. See PR rust-lang#67020 and PR rust-lang#71131 for more details. However, this turns out be insufficient to guarantee that it's safe to reuse the post-LTO module (i.e. that optimizing the pre-LTO module would produce the same result). When LLVM optimizes a module during ThinLTO, it may look at other information from the 'module index', such as whether a (non-imported!) global variable is used. If this information changes between compilation runs, we may end up re-using an optimized module that (for example) had dead-code elimination run on a function that is now used by another module. Fortunately, LLVM implements its own ThinLTO module cache, which is used when ThinLTO is performed by a linker plugin (e.g. when clang is used to compile a C proect). Using this cache directly would require extensive refactoring of our code - but fortunately for us, LLVM provides a function that does exactly what we need. The function `llvm::computeLTOCacheKey` is used to compute a SHA-1 hash from all data that might influence the result of ThinLTO on a module. In addition to the module imports/exports that we manually track, it also hashes information about global variables (e.g. their liveness) which might be used during optimization. By using this function, we shouldn't have to worry about new LLVM passes breaking our module re-use behavior. In LLVM, the output of this function forms part of the filename used to store the post-ThinLTO module. To keep our current filename structure intact, this PR just writes out the mapping 'CGU name -> Hash' to a file. To determine if a post-LTO module should be reused, we compare hashes from the previous session. This should unblock PR rust-lang#75199 - by sheer chance, it seems to have hit this issue due to the particular CGU partitioning and optimization decisions that end up getting made.

rust-highfive · 2020-09-18T02:06:08Z

r? @petrochenkov

(rust_highfive has picked a reviewer for you, use r? to override)

Aaron1011 · 2020-09-18T02:08:10Z

@bors try @rust-timer queue

rust-timer · 2020-09-18T02:08:11Z

Awaiting bors try build completion

bors · 2020-09-18T02:08:21Z

⌛ Trying commit cfe07cd with merge f35705fb01020929514f970af7f0c5878c90cb37...

bors · 2020-09-18T02:52:53Z

☀️ Try build successful - checks-actions, checks-azure
Build commit: f35705fb01020929514f970af7f0c5878c90cb37 (f35705fb01020929514f970af7f0c5878c90cb37)

rust-timer · 2020-09-18T02:52:55Z

Queued f35705fb01020929514f970af7f0c5878c90cb37 with parent f3c923a, future comparison URL.

rust-timer · 2020-09-18T04:29:22Z

Finished benchmarking try commit (f35705fb01020929514f970af7f0c5878c90cb37): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never

petrochenkov · 2020-09-18T08:04:10Z

r? @pnkfelix

Mark-Simulacrum · 2020-10-07T14:13:36Z

cc @rust-lang/wg-incr-comp -- could we find a reviewer for this PR? It's blocking re-enabling debug assertions in CI on some builders (#75199).

davidtwco

I'm not too familiar with ThinLTO, but this all seems reasonable to me.

r=me unless you want another review

Aaron1011 · 2020-10-08T15:34:37Z

@nikic Does this look reasonable to you?

nikic · 2020-10-11T20:27:37Z

Looks very reasonable, and is hopefully the end of the road for this class of problems...

@bors r=davidtwco,nikic

bors · 2020-10-11T20:27:38Z

📌 Commit cfe07cd has been approved by davidtwco,nikic

bors · 2020-10-11T20:50:07Z

⌛ Testing commit cfe07cd with merge c71248b...

bors · 2020-10-11T22:41:36Z

☀️ Test successful - checks-actions, checks-azure
Approved by: davidtwco,nikic
Pushing c71248b to master...

Mark-Simulacrum · 2020-10-22T01:08:04Z

Presumably due to finer-grained tracking in some cases, this was a major improvement on some incremental tests. Of course, the correctness is much more important here :)

…youxu Call module_name_to_str instead of just unwrapping This makes the ICE message in rust-lang#130678 more clear. It looks like not calling this function was just an oversight in rust-lang#76859, but clearly not a major one because it's taken us 4 years to notice.

Call module_name_to_str instead of just unwrapping This makes the ICE message in rust-lang#130678 more clear. It looks like not calling this function was just an oversight in rust-lang#76859, but clearly not a major one because it's taken us 4 years to notice. try-job: i686-msvc

Call module_name_to_str instead of just unwrapping This makes the ICE message in rust-lang/rust#130678 more clear. It looks like not calling this function was just an oversight in rust-lang/rust#76859, but clearly not a major one because it's taken us 4 years to notice. try-job: i686-msvc

rust-highfive assigned petrochenkov Sep 18, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Sep 18, 2020

rust-highfive assigned pnkfelix and unassigned petrochenkov Sep 18, 2020

davidtwco approved these changes Oct 8, 2020

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 11, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Oct 11, 2020

bors merged commit c71248b into rust-lang:master Oct 11, 2020

rustbot added this to the 1.49.0 milestone Oct 11, 2020

Aaron1011 mentioned this pull request Oct 12, 2020

Re-enable debug and LLVM assertions #75199

Merged

Aaron1011 mentioned this pull request Dec 13, 2020

Constantly receiving Undefined symbols for architecture x86_64 #79946

Open

saethlin mentioned this pull request Sep 21, 2024

Call module_name_to_str instead of just unwrapping #130680

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use llvm::computeLTOCacheKey to determine post-ThinLTO CGU reuse #76859

Use llvm::computeLTOCacheKey to determine post-ThinLTO CGU reuse #76859

Aaron1011 commented Sep 18, 2020

rust-highfive commented Sep 18, 2020

Aaron1011 commented Sep 18, 2020

rust-timer commented Sep 18, 2020

bors commented Sep 18, 2020

bors commented Sep 18, 2020

rust-timer commented Sep 18, 2020

rust-timer commented Sep 18, 2020

petrochenkov commented Sep 18, 2020

Mark-Simulacrum commented Oct 7, 2020

davidtwco left a comment

Aaron1011 commented Oct 8, 2020

nikic commented Oct 11, 2020

bors commented Oct 11, 2020

bors commented Oct 11, 2020

bors commented Oct 11, 2020

Mark-Simulacrum commented Oct 22, 2020

Use llvm::computeLTOCacheKey to determine post-ThinLTO CGU reuse #76859

Use llvm::computeLTOCacheKey to determine post-ThinLTO CGU reuse #76859

Conversation

Aaron1011 commented Sep 18, 2020

rust-highfive commented Sep 18, 2020

Aaron1011 commented Sep 18, 2020

rust-timer commented Sep 18, 2020

bors commented Sep 18, 2020

bors commented Sep 18, 2020

rust-timer commented Sep 18, 2020

rust-timer commented Sep 18, 2020

petrochenkov commented Sep 18, 2020

Mark-Simulacrum commented Oct 7, 2020

davidtwco left a comment

Choose a reason for hiding this comment

Aaron1011 commented Oct 8, 2020

nikic commented Oct 11, 2020

bors commented Oct 11, 2020

bors commented Oct 11, 2020

bors commented Oct 11, 2020

Mark-Simulacrum commented Oct 22, 2020