Get piece unchecked in `write` #83302

camsteffen · 2021-03-19T14:34:04Z

We already use specialized zip, but it seems like we can do a little better by not checking pieces length at all.

Arguments constructors are now unsafe. So the format_args! expansion now includes an unsafe block.

Local Bench Diff

 name                        before ns/iter  after ns/iter  diff ns/iter   diff %  speedup 
 fmt::write_str_macro1       22,967          19,718               -3,249  -14.15%   x 1.16 
 fmt::write_str_macro2       35,527          32,654               -2,873   -8.09%   x 1.09 
 fmt::write_str_macro_debug  571,953         575,973               4,020    0.70%   x 0.99 
 fmt::write_str_ref          9,579           9,459                  -120   -1.25%   x 1.01 
 fmt::write_str_value        9,573           9,572                    -1   -0.01%   x 1.00 
 fmt::write_u128_max         176             173                      -3   -1.70%   x 1.02 
 fmt::write_u128_min         138             134                      -4   -2.90%   x 1.03 
 fmt::write_u64_max          139             136                      -3   -2.16%   x 1.02 
 fmt::write_u64_min          129             135                       6    4.65%   x 0.96 
 fmt::write_vec_macro1       24,401          22,273               -2,128   -8.72%   x 1.10 
 fmt::write_vec_macro2       37,096          35,602               -1,494   -4.03%   x 1.04 
 fmt::write_vec_macro_debug  588,291         589,575               1,284    0.22%   x 1.00 
 fmt::write_vec_ref          9,568           9,732                   164    1.71%   x 0.98 
 fmt::write_vec_value        9,516           9,625                   109    1.15%   x 0.99

rust-highfive · 2021-03-19T14:34:07Z

r? @sfackler

(rust-highfive has picked a reviewer for you, use r? to override)

joshtriplett · 2021-03-19T18:41:16Z

This seems a little surprising. Iterators like args.pieces.iter() should not need to do bounds checks, in general; they know their own bounds.

camsteffen · 2021-03-19T18:45:16Z

This does not remove a bounds check within the loop. Only a length check of pieces when initializing ZipImpl.

sfackler · 2021-03-20T17:15:07Z

Arguments can be safely (if unstably) constructed, so this is unsound as-is: https://doc.rust-lang.org/src/core/fmt/mod.rs.html#317-332

camsteffen · 2021-03-20T21:20:26Z

I added assertions. I think that shouldn't be an issue, unless someone is using fmt_internals erroneously. Another local bench run was all the same or better.

library/core/src/fmt/mod.rs

bors · 2021-03-27T22:45:23Z

☔ The latest upstream changes (presumably #83580) made this pull request unmergeable. Please resolve the merge conflicts.

library/core/src/fmt/mod.rs

camsteffen · 2021-04-06T16:21:01Z

Now any unsafe block like format_args!("..", unsafe { .. }) is redundant which is causing test failures. To fix this, I think I would have to change format_args! to expand to, for example,

// format_args!("hello {}", world)
match match (&world,) {
    (arg0,) => [::core::fmt::ArgumentV1::new(
        arg0,
        ::core::fmt::Display::fmt,
    )],
} {
    ref args => unsafe { ::core::fmt::Arguments::new_v1(&["hello "], args) },
};

Thoughts?

crlf0710 · 2021-04-23T13:53:34Z

@camsteffen Ping from triage, CI is still red here. Maybe you can use any change necessary to get CI green, so we can pass this on to the reviewer?

JohnCSimon · 2021-05-17T03:07:33Z

camsteffen requested a review from sfackler 16 days ago

@rustbot label: +S-waiting-on-review -S-waiting-on-author

rylev · 2021-09-01T08:23:54Z

This caused a large regression in instruction counts when testing the compiler. This is a largely a change in std lib code, and we don't really have a good process for dealing with how std lib changes effect performance.

This seems to be primarily affecting debug and check builds, but there doesn't seem to be a query that is clearly to blame here. Given the motivation of this PR is primarily performance, I think it deserves a closer look.

As part of the performance triage process, I'm marking this as a performance regression. If you believe this performance regression to be justifiable or once you have an issue or PR that addresses this regression, please mark this PR with the label perf-regression-triaged.

@rustbot label +perf-regression

…tolnay Get piece unchecked in `write` We already use specialized `zip`, but it seems like we can do a little better by not checking `pieces` length at all. `Arguments` constructors are now unsafe. So the `format_args!` expansion now includes an `unsafe` block. <details> <summary>Local Bench Diff</summary> ```text name before ns/iter after ns/iter diff ns/iter diff % speedup fmt::write_str_macro1 22,967 19,718 -3,249 -14.15% x 1.16 fmt::write_str_macro2 35,527 32,654 -2,873 -8.09% x 1.09 fmt::write_str_macro_debug 571,953 575,973 4,020 0.70% x 0.99 fmt::write_str_ref 9,579 9,459 -120 -1.25% x 1.01 fmt::write_str_value 9,573 9,572 -1 -0.01% x 1.00 fmt::write_u128_max 176 173 -3 -1.70% x 1.02 fmt::write_u128_min 138 134 -4 -2.90% x 1.03 fmt::write_u64_max 139 136 -3 -2.16% x 1.02 fmt::write_u64_min 129 135 6 4.65% x 0.96 fmt::write_vec_macro1 24,401 22,273 -2,128 -8.72% x 1.10 fmt::write_vec_macro2 37,096 35,602 -1,494 -4.03% x 1.04 fmt::write_vec_macro_debug 588,291 589,575 1,284 0.22% x 1.00 fmt::write_vec_ref 9,568 9,732 164 1.71% x 0.98 fmt::write_vec_value 9,516 9,625 109 1.15% x 0.99 ``` </details>

camsteffen · 2021-09-03T15:39:59Z

The regression seems to be caused by the change in the format_args! expansion which is now more complex in order to have an unsafe block that surrounds Arguments::new_v1 but does not surround the format_args! macro input. I reverted that particular change as an experiment in #88571 and that seemed to fix the perf regression. I don't know how to get a closer look at what is going on with style-servo-debug and would be interested in any pointers.

Mark-Simulacrum · 2021-09-20T14:42:37Z

Hm. So #88571 does look to fix the majority of the regression here, but I suppose we cannot land it as-is due to the unsafe nesting causing some problems with spurious lints.

I would lean towards reverting this PR; the small wins in runtime performance do not seem worth it given that fmt isn't optimized for performance anyway (indirect calls, can't get inlined, etc.). The compile time regressions are not insignificant.

camsteffen · 2021-09-20T19:21:24Z

My only reservation with reverting is that the internal fmt API was already unsafe before this PR added the unsafe modifiers.

Mark-Simulacrum · 2021-09-21T02:08:10Z

I.. sort of doubt that the get_unchecked there matters that much either, so I'd at least be fine with dropping it too (in favor of safety) as part of a pseudo-revert.

Mark-Simulacrum · 2021-09-21T02:11:00Z

Looks like the justification is noted in the commit (so nice!) -- d80f127. I think that makes sense, but we could likely get away with something "not wrong" as a result without invoking the full panic machinery. e.g., get().unwrap_or("") may work OK in some cases.

We could also move the unsafety as an assertion into a ZST that the caller must construct via an unsafe method, which would get compiled out but also allow us to put the unsafe block in a more local place, rather than wrapping the whole function call.

Use ZST for fmt unsafety as suggested here - rust-lang#83302 (comment).

Use ZST for fmt unsafety as suggested here - rust-lang/rust#83302 (comment).

camsteffen · 2021-10-04T13:35:09Z

@Mark-Simulacrum I'm thinking the next step here for perf triage is to do a perf run reverting this and the follow-up PRs?

Mark-Simulacrum · 2021-10-04T13:53:29Z

I think #89139 definitely helped with some of the regressions caused by this PR. I think a trial revert would still be good, though, to estimate how much of a performance hit this is with the mitigations in place (like ZST-based unsafe).

camsteffen · 2021-10-04T18:50:06Z

The revert perf run looks good! #89521 (comment)

Mark-Simulacrum · 2021-10-04T19:10:57Z

Hm, the revert results feel a little unexpected. Unless there's been unrelated performance optimization that mitigates much of the regression seen in this PR, we would expect the revert to look equivalent, right? But instead we see barely any improvement.

rust-highfive assigned sfackler Mar 19, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Mar 19, 2021

sfackler reviewed Mar 22, 2021

View reviewed changes

library/core/src/fmt/mod.rs Outdated Show resolved Hide resolved

camsteffen force-pushed the write-piece-unchecked branch from 1e04a07 to dfb16ef Compare March 23, 2021 02:08

camsteffen force-pushed the write-piece-unchecked branch from dfb16ef to a24ced6 Compare March 28, 2021 00:40

sfackler reviewed Mar 29, 2021

View reviewed changes

library/core/src/fmt/mod.rs Outdated Show resolved Hide resolved

camsteffen force-pushed the write-piece-unchecked branch 3 times, most recently from 5eab7b6 to e8ec0bb Compare April 6, 2021 14:42

This comment has been minimized.

Sign in to view

camsteffen force-pushed the write-piece-unchecked branch from e8ec0bb to 18d508c Compare April 6, 2021 15:16

This comment has been minimized.

Sign in to view

crlf0710 added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 23, 2021

camsteffen force-pushed the write-piece-unchecked branch from 18d508c to 6a1fa08 Compare April 29, 2021 17:39

This comment has been minimized.

Sign in to view

camsteffen requested a review from sfackler May 1, 2021 02:29

bors added the merged-by-bors This PR was explicitly merged by bors. label Aug 24, 2021

bors merged commit de42550 into rust-lang:master Aug 24, 2021

rustbot added this to the 1.56.0 milestone Aug 24, 2021

camsteffen deleted the write-piece-unchecked branch August 24, 2021 01:44

bors mentioned this pull request Aug 24, 2021

Make format_args!("literal") const. #87005

Closed

bjorn3 mentioned this pull request Aug 25, 2021

Standard macros reported as missing-unsafe rust-lang/rust-analyzer#10022

Closed

rustbot added the perf-regression Performance regression. label Sep 1, 2021

camsteffen mentioned this pull request Sep 1, 2021

Perf experiment for "Get piece unchecked in write" #88571

Closed

camsteffen mentioned this pull request Sep 21, 2021

Use ZST for fmt unsafety #89139

Merged

bors added a commit to rust-lang-ci/rust that referenced this pull request Sep 23, 2021

Auto merge of rust-lang#89139 - camsteffen:write-perf, r=Mark-Simulacrum

67365d6

Use ZST for fmt unsafety as suggested here - rust-lang#83302 (comment).

flip1995 pushed a commit to flip1995/rust-clippy that referenced this pull request Sep 28, 2021

Auto merge of #89139 - camsteffen:write-perf, r=Mark-Simulacrum

edaeacf

Use ZST for fmt unsafety as suggested here - rust-lang/rust#83302 (comment).

camsteffen mentioned this pull request Oct 4, 2021

Perf experiment for "Get piece unchecked in write" #89521

Closed

Mark-Simulacrum added the perf-regression-triaged The performance regression has been triaged. label Oct 26, 2021

davidlattimore mentioned this pull request Jul 8, 2022

Remove unnecessary unsafe from format_args expansion rust-lang/rust-analyzer#12719

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get piece unchecked in `write` #83302

Get piece unchecked in `write` #83302

camsteffen commented Mar 19, 2021 •

edited

Loading

rust-highfive commented Mar 19, 2021

joshtriplett commented Mar 19, 2021

camsteffen commented Mar 19, 2021

sfackler commented Mar 20, 2021

camsteffen commented Mar 20, 2021

bors commented Mar 27, 2021

This comment has been minimized.

This comment has been minimized.

camsteffen commented Apr 6, 2021 •

edited

Loading

crlf0710 commented Apr 23, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

JohnCSimon commented May 17, 2021

rylev commented Sep 1, 2021

camsteffen commented Sep 3, 2021 •

edited

Loading

Mark-Simulacrum commented Sep 20, 2021

camsteffen commented Sep 20, 2021

Mark-Simulacrum commented Sep 21, 2021

Mark-Simulacrum commented Sep 21, 2021

camsteffen commented Oct 4, 2021

Mark-Simulacrum commented Oct 4, 2021

camsteffen commented Oct 4, 2021

Mark-Simulacrum commented Oct 4, 2021

Get piece unchecked in write #83302

Get piece unchecked in write #83302

Conversation

camsteffen commented Mar 19, 2021 • edited Loading

rust-highfive commented Mar 19, 2021

joshtriplett commented Mar 19, 2021

camsteffen commented Mar 19, 2021

sfackler commented Mar 20, 2021

camsteffen commented Mar 20, 2021

bors commented Mar 27, 2021

This comment has been minimized.

This comment has been minimized.

camsteffen commented Apr 6, 2021 • edited Loading

crlf0710 commented Apr 23, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

JohnCSimon commented May 17, 2021

rylev commented Sep 1, 2021

camsteffen commented Sep 3, 2021 • edited Loading

Mark-Simulacrum commented Sep 20, 2021

camsteffen commented Sep 20, 2021

Mark-Simulacrum commented Sep 21, 2021

Mark-Simulacrum commented Sep 21, 2021

camsteffen commented Oct 4, 2021

Mark-Simulacrum commented Oct 4, 2021

camsteffen commented Oct 4, 2021

Mark-Simulacrum commented Oct 4, 2021

Get piece unchecked in `write` #83302

Get piece unchecked in `write` #83302

camsteffen commented Mar 19, 2021 •

edited

Loading

camsteffen commented Apr 6, 2021 •

edited

Loading

camsteffen commented Sep 3, 2021 •

edited

Loading