Verification time increased by 7X on an s2n-quic harness #1693

zhassan-aws · 2022-09-17T06:54:38Z

Noticed today that s2n-quic's vectored_copy_fuzz_test harness takes 30 minutes to prove, even though a few weeks ago, it only took 4 minutes. The culprit commit turned out to be ab0f2ad from Aug 18. With this commit, it takes 30 minutes, but with the previous commit (8b1d18c), it takes 4 minutes.

To reproduce, do:

git checkout 8b1d18c436353a4428f58be268d1a2df04015faf
cargo clean
cargo build --workspace
cd tests/perf/s2n-quic/quic/s2n-quic-core
cargo kani --harness vectored_copy_fuzz_test
This produces:

SUMMARY:
 ** 0 of 664 failed (1 unreachable)

VERIFICATION:- SUCCESSFUL

Verification Time: 214.41763s

Do git checkout ab0f2add72b9fd49a8410b7de981702cc3e784d6
Repeat steps 2 - 5
Output:


SUMMARY:
 ** 0 of 664 failed (1 unreachable)

VERIFICATION:- SUCCESSFUL

Verification Time: 1803.3752s

The text was updated successfully, but these errors were encountered:

zhassan-aws · 2022-09-19T17:42:20Z

@tedinski, any ideas what may have caused the performance slowdown in ab0f2ad?

tedinski · 2022-09-19T22:10:18Z

That PR turned assert into assert-assume. Normally we'd think of it as pruning branches, and so I'd assume it'd improve performance if anything.

But perhaps it's complicating the state with new assumptions along the paths it does follow.

zhassan-aws · 2022-11-10T17:19:36Z

Summarizing the current status of the investigation:

There are at least a couple of commits that contributed to the increase in verification time:

ab0f2ad (4 minutes to 30 minutes): in particular the change from assert to assert-assume for the assume intrinsic:
ab0f2ad#diff-77bb4928612d2e4c4cd8a6ff43124d05f89d423aaa128656e5a9635dd01e2648R443
and codegen_offset:
ab0f2ad#diff-77bb4928612d2e4c4cd8a6ff43124d05f89d423aaa128656e5a9635dd01e2648R1060

Rewinding back to this commit and changing those two codegen_assert_assume to codegen_assert bring the time down to 4 minutes. However, making those two changes off of more recent commits (e.g. 1f40c65), only brings down the time from 60 minutes to 30 minutes.

We tried introducing a temporary variable for the condition that is then used in the assert and assume, but that didn't make a difference.

07092de (30 minutes to 60 minutes): The rust toolchain update included a change in niche optimization, which is likely the cause of the performance regression.

zhassan-aws · 2022-12-21T05:03:18Z

With kissat, this harness completes in ~1 minute:

$ cargo kani --tests --harness vectored_copy_fuzz_test --no-assertion-reach-checks --enable-unstable --cbmc-args --external-sat-solver kissat
...
SUMMARY:
 ** 0 of 723 failed

VERIFICATION:- SUCCESSFUL
Verification Time: 59.52028s

I had to disable reachability checks though (--no-assertion-reach-checks) because of #1962. However, even if I disable them but use the default SAT solver, it takes close to an hour:

$ cargo kani --tests --harness vectored_copy_fuzz_test --no-assertion-reach-checks
...
SUMMARY:
 ** 0 of 723 failed

VERIFICATION:- SUCCESSFUL
Verification Time: 3266.809s

zhassan-aws · 2023-02-15T23:16:51Z

@tautschnig These are the current results with 9439a54 and with CBMC diffblue/cbmc@42d5ce2 built with MiniSAT 2 and CaDiCaL.

With MiniSAT:

cargo kani --tests --harness vectored_copy_fuzz_test
Verification Time: 2485.4717s

With CaDiCaL:

$ cargo kani --tests --harness vectored_copy_fuzz_test --enable-unstable --cbmc-args --sat-solver cadical
Verification Time: 102.70173s

With Kissat:

$ cargo kani --tests --harness vectored_copy_fuzz_test --enable-unstable --cbmc-args --external-sat-solver kissat
Verification Time: 121.14342s

So CaDiCaL is indeed slightly faster.

All results can be reproduced by doing cd tests/perf/s2n-quic/quic/s2n-quic-core and running the specified command.

tautschnig · 2023-08-18T07:59:25Z

Resolving with following rationale:

We understand that it is the assert-followed-by-assume that increased the runtime (ab0f2ad), but we do still want to keep that change.
The increased runtime is only observed with MiniSat, which no longer is the default with Kani (as of Change default solver to CaDiCal #2654).
The s2n-quic harness has long ago been enabled by first choosing Kissat (tests(s2n-quic-core): update kani to 0.21 aws/s2n-quic#1628) and later CaDiCaL (Use cadical solver for some Kani harnesses aws/s2n-quic#1771). It's now part of our performance benchmarking that we do in CI.

zhassan-aws added [C] Bug This is a bug. Something isn't working. [E] Performance Track performance improvement (Time / Memory / CPU) labels Sep 17, 2022

zhassan-aws self-assigned this Oct 24, 2022

zhassan-aws added this to Kani 0.14 Oct 24, 2022

rahulku added this to the Maintenance milestone Oct 25, 2022

rahulku moved this to In Progress in Kani 0.14 Oct 25, 2022

adpaco-aws added the T-High Priority Tag issues that have high priority label Nov 2, 2022

zhassan-aws added this to Kani 0.15 Nov 3, 2022

zhassan-aws added the T-User Tag user issues / requests label Nov 15, 2022

zhassan-aws mentioned this issue Dec 6, 2022

Provide an option to disable automatically-generated assumptions #1966

Open

zhassan-aws assigned tautschnig Dec 13, 2022

tautschnig closed this as completed Aug 18, 2023

github-project-automation bot moved this from In Progress to Done in Kani 0.14 Aug 18, 2023

github-project-automation bot moved this to Done in Kani 0.15 Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verification time increased by 7X on an s2n-quic harness #1693

Verification time increased by 7X on an s2n-quic harness #1693

zhassan-aws commented Sep 17, 2022 •

edited

Loading

zhassan-aws commented Sep 19, 2022

tedinski commented Sep 19, 2022

zhassan-aws commented Nov 10, 2022

zhassan-aws commented Dec 21, 2022

zhassan-aws commented Feb 15, 2023

tautschnig commented Aug 18, 2023

Verification time increased by 7X on an s2n-quic harness #1693

Verification time increased by 7X on an s2n-quic harness #1693

Comments

zhassan-aws commented Sep 17, 2022 • edited Loading

zhassan-aws commented Sep 19, 2022

tedinski commented Sep 19, 2022

zhassan-aws commented Nov 10, 2022

zhassan-aws commented Dec 21, 2022

zhassan-aws commented Feb 15, 2023

tautschnig commented Aug 18, 2023

zhassan-aws commented Sep 17, 2022 •

edited

Loading