Add comments to explain memory usage optimization #78887

camelid · 2020-11-08T21:53:11Z

Add explanatory comments so that people understand that it's just an optimization and doesn't affect behavior.

rust-highfive · 2020-11-08T21:53:14Z

(rust_highfive has picked a reviewer for you, use r? to override)

camelid · 2020-11-08T21:57:44Z

compiler/rustc_mir/src/dataflow/framework/engine.rs

-            state.clone_from(&entry_sets[bb]);
+            let mut state = entry_sets[bb].clone();


IIUC state.clone_from(...) and state = (...).clone() should be equivalent in functionality, so this shouldn't change the behavior.

if we have a domain which potentially reuses an allocation when using clone_from this could matter, but I don't really know if that can be the case

Doesn't BitSet already do that?

Should we just do a perf run to check?

I'd rather not remove this optimization, even if it doesn't affect perf today

Hmm. I found this code confusing when I read through it since it makes it look like state is shared between each loop iteration, when in fact it's reinitialized for each iteration.

lcnr · 2020-11-08T22:19:30Z

r? @jonas-schievink

jonas-schievink · 2020-11-08T23:40:04Z

@bors try @rust-timer queue

rust-timer · 2020-11-08T23:40:06Z

Awaiting bors try build completion

bors · 2020-11-08T23:40:18Z

⌛ Trying commit 172b60de3067c9b66bd111298fc898e739ebb8e6 with merge f79c7551c0ed89caa2987f63188f7a66b7eea242...

bors · 2020-11-09T00:24:17Z

☀️ Try build successful - checks-actions
Build commit: f79c7551c0ed89caa2987f63188f7a66b7eea242 (f79c7551c0ed89caa2987f63188f7a66b7eea242)

rust-timer · 2020-11-09T00:24:19Z

Queued f79c7551c0ed89caa2987f63188f7a66b7eea242 with parent 1773f60, future comparison URL.

rust-timer · 2020-11-09T02:01:27Z

Finished benchmarking try commit (f79c7551c0ed89caa2987f63188f7a66b7eea242): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot modify labels: +S-waiting-on-review -S-waiting-on-perf

camelid · 2020-11-09T02:49:10Z

The results seem pretty neutral. However, I've never analyzed perf results before, so take that with a grain of salt :)

jonas-schievink · 2020-11-09T11:43:44Z

There are small but clear regressions in a lot of benchmarks though?

camelid · 2020-11-09T18:13:02Z

Looking at wall-clock time, it seems that there were some improvements and some regressions. Btw, are there docs on how to interpret perf results?

jonas-schievink · 2020-11-09T18:33:05Z

Anything based on real-world time is going to be too noisy to be useful for a low-impact change like this. I don't think we have any docs on this at the moment (but many parts are not Rust-specific, they apply to interpreting profiles in general).

Regarding this PR, I don't think we want to land this as-is, but I'd be happy to r+ a patch that adds a short comment explaining why clone_from is used.

Mark-Simulacrum · 2020-11-09T18:43:10Z

I agree that this is a small, but fairly clear, regression in instruction counts on multiple benchmarks. Wall times are indeed too noisy for looking at changes like this.

We do not have significant documentation on performance triaging, it probably makes sense to write something but I don't know if I can find the time to do so myself.

camelid · 2020-11-09T21:11:11Z

Okay, I'll change this so that it's just adding an explanatory comment :)

camelid · 2020-11-09T21:38:24Z

compiler/rustc_mir/src/dataflow/framework/engine.rs

@@ -208,12 +208,19 @@ where
            }
        }

+        // `state` is not actually used between iterations;
+        // this is just an optimization to avoid reallocating
+        // every iteration.
        let mut state = analysis.bottom_value(body);


Here's an idea for a further optimization: change this to

Suggested change

let mut state = analysis.bottom_value(body);

let mut state = MaybeUninit::uninit();

since it's always initialized with clone_from below. Might not be worth the unsafety though 🤷

jonas-schievink · 2020-11-09T22:06:02Z

@bors r+ rollup

bors · 2020-11-09T22:06:04Z

📌 Commit 0242f96 has been approved by jonas-schievink

…as-schievink Rollup of 14 pull requests Successful merges: - rust-lang#76765 (Make it more clear what an about async fn's returns when referring to what it returns) - rust-lang#78574 (Use check-pass instead of build-pass in regions ui test suite) - rust-lang#78669 (Use check-pass instead of build-pass in some consts ui test suits) - rust-lang#78847 (Assert that a return place is not used for indexing during integration) - rust-lang#78854 (Workaround for "could not fully normalize" ICE ) - rust-lang#78875 (rustc_target: Further cleanup use of target options) - rust-lang#78887 (Add comments to explain memory usage optimization) - rust-lang#78890 (comment attribution fix) - rust-lang#78896 (Clarified description of write! macro) - rust-lang#78897 (Add missing newline to error message of the default OOM hook) - rust-lang#78898 (add regression test for rust-lang#78892) - rust-lang#78908 ((rustdoc) [src] link for types defined by macros shows invocation, not defintion) - rust-lang#78910 (Fix links to stabilized versions of some intrinsics) - rust-lang#78912 (Add macro test for min-const-generics) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup

camelid added A-mir-opt Area: MIR optimizations C-cleanup Category: PRs that clean code up or issues documenting cleanup. labels Nov 8, 2020

rust-highfive assigned lcnr Nov 8, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Nov 8, 2020

camelid commented Nov 8, 2020

View reviewed changes

rust-highfive assigned jonas-schievink and unassigned lcnr Nov 8, 2020

camelid added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Nov 9, 2020

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Nov 9, 2020

Add comments to explain memory usage optimization

0242f96

camelid force-pushed the dataflow-state-decl branch from 172b60d to 0242f96 Compare November 9, 2020 21:35

camelid changed the title ~~Move state declaration into loop to make code clearer~~ Add comments to explain memory usage optimization Nov 9, 2020

camelid commented Nov 9, 2020

View reviewed changes

camelid added A-docs Area: documentation for any part of the project, including the compiler, standard library, and tools and removed C-cleanup Category: PRs that clean code up or issues documenting cleanup. labels Nov 9, 2020

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 9, 2020

jonas-schievink mentioned this pull request Nov 10, 2020

Rollup of 14 pull requests #78920

Merged

bors merged commit a08e7af into rust-lang:master Nov 11, 2020

rustbot added this to the 1.49.0 milestone Nov 11, 2020

camelid deleted the dataflow-state-decl branch November 11, 2020 04:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comments to explain memory usage optimization #78887

Add comments to explain memory usage optimization #78887

camelid commented Nov 8, 2020 •

edited

Loading

rust-highfive commented Nov 8, 2020

camelid Nov 8, 2020 •

edited

Loading

lcnr Nov 8, 2020

jonas-schievink Nov 8, 2020

camelid Nov 8, 2020

jonas-schievink Nov 8, 2020

camelid Nov 8, 2020

lcnr commented Nov 8, 2020

jonas-schievink commented Nov 8, 2020

rust-timer commented Nov 8, 2020

bors commented Nov 8, 2020

bors commented Nov 9, 2020

rust-timer commented Nov 9, 2020

rust-timer commented Nov 9, 2020

camelid commented Nov 9, 2020

jonas-schievink commented Nov 9, 2020

camelid commented Nov 9, 2020

jonas-schievink commented Nov 9, 2020

Mark-Simulacrum commented Nov 9, 2020

camelid commented Nov 9, 2020

camelid Nov 9, 2020

jonas-schievink commented Nov 9, 2020

bors commented Nov 9, 2020

		state.clone_from(&entry_sets[bb]);
		let mut state = entry_sets[bb].clone();

	let mut state = analysis.bottom_value(body);
	let mut state = MaybeUninit::uninit();

Add comments to explain memory usage optimization #78887

Add comments to explain memory usage optimization #78887

Conversation

camelid commented Nov 8, 2020 • edited Loading

rust-highfive commented Nov 8, 2020

camelid Nov 8, 2020 • edited Loading

Choose a reason for hiding this comment

lcnr Nov 8, 2020

Choose a reason for hiding this comment

jonas-schievink Nov 8, 2020

Choose a reason for hiding this comment

camelid Nov 8, 2020

Choose a reason for hiding this comment

jonas-schievink Nov 8, 2020

Choose a reason for hiding this comment

camelid Nov 8, 2020

Choose a reason for hiding this comment

lcnr commented Nov 8, 2020

jonas-schievink commented Nov 8, 2020

rust-timer commented Nov 8, 2020

bors commented Nov 8, 2020

bors commented Nov 9, 2020

rust-timer commented Nov 9, 2020

rust-timer commented Nov 9, 2020

camelid commented Nov 9, 2020

jonas-schievink commented Nov 9, 2020

camelid commented Nov 9, 2020

jonas-schievink commented Nov 9, 2020

Mark-Simulacrum commented Nov 9, 2020

camelid commented Nov 9, 2020

camelid Nov 9, 2020

Choose a reason for hiding this comment

jonas-schievink commented Nov 9, 2020

bors commented Nov 9, 2020

camelid commented Nov 8, 2020 •

edited

Loading

camelid Nov 8, 2020 •

edited

Loading