SignaturePruning: Properly handle public types #6630

kripken · 2024-05-29T17:51:00Z

The SignaturePruning pass optimizes away parameters that it proves are safe to
remove. It turns out that that does not always match the definition of private
types, which is more restrictive. Specifically, if say all the types are in one big
rec group and one of them is used on an exported function then all of them are
considered public (as the rec group is). However, in closed world, it would be ok
to leave that rec group unchanged but to create a pruned version of that type
and use it, in cases where we see it is safe to remove a parameter. (See the
testcase in the PR for a concrete example.)

To put it another way, SignaturePruning already proves that a parameter is
safe to remove in all the ways that matter. Before this PR, however, the testcase
in this PR would error - so this PR is not an optimization but a bugfix, really -
because SignaturePruning would see that a parameter is safe to remove but
then TypeUpdating would see the type is public and so it would leave it alone,
leading to a broken module.

This situation is in fact not that rare, and happens on real-world Java code.
The reason we did not notice it before is that typically there are no remaining
SignaturePruning opportunities late in the process (when other closed world
optimizations have typically led to a single big rec group).

The concrete fix here is to add additionalPrivateTypes to a few more places
in TypeUpdating. We already supported that for cases where a pass knew
better than the general logic what can be modified, and this adds that
ability to the signature-rewriting logic there. Then SignaturePruning can
send in all the types it has proven are safe to modify.

Also necessary here is to only add from additionalPrivateTypes if the type
is not already in our list (or we'd end up with duplicates in the final rec
group).
Also move newSignatures in SignaturePruning out of the top level, which
was confusing (the pass has multiple iterations, and we want each to have
a fresh instance).

tlively · 2024-05-29T18:06:06Z

test/lit/passes/signature-pruning.wast

+   ;; CHECK:      (rec
+   ;; CHECK-NEXT:  (type $none (func))
+   (type $none (func))
+   ;; CHECK:       (type $much (func (param i32)))


It looks like the type name is repeated in the output, which we shouldn't allow.

Yeah, that's a quirk we have atm. It's actually useful in cases where the old and new types remain in use (like here), but we should probably ensure a new unique name. I can look into that separately.

I would expect us to have problems with text round trip fuzzing before this is fixed.

I don't see issues in fuzzing so far, but this is a pre-existing issue, so that isn't surprising. It must take specific fuzzer luck to get it.

Experimenting with a patch that deduplicates the names, the problem is that it will generate a large diff on existing tests (before this PR; unrelated) regardless of whether we deduplicate the old or the new name. We seem to just have enough cases that both remain in use. The diff one way is 66 K and the other is 276 K, so at least one is clearly less annoying, but it will still be a quite large change unfortunately.

src/passes/SignaturePruning.cpp

Co-authored-by: Thomas Lively <[email protected]>

tlively

LGTM besides that one issue that can be fixed separately.

…6642) We need StringLowering to modify even public types, as it must replace every single stringref with externref, even if that modifies the ABI. To achieve that we told it that all string-using types were private, which let TypeUpdater update them, but the problem is that it moves all private types to a new single rec group, which meant public and private types ended up in the same group. As a result, a single public type would make it all public, preventing optimizations and breaking things as in #6630 #6640. Ideally TypeUpdater would modify public types while keeping them in the same rec groups, but this may be a very specific issue for StringLowering, and that might be a lot of work. Instead, just make StringLowering handle public types of functions in a manual way, which is simple and should handle all cases that matter in practice, at least in J2Wasm.

kripken added 10 commits May 28, 2024 14:43

fix

54d8e73

hmm

6487897

fix

9da9900

close

cba4a85

work

1c72e1e

fix

fceb73a

format

5f82586

comment

a4904ed

wat

e3fb49c

fastr

56900e6

kripken requested a review from tlively May 29, 2024 17:51

tlively reviewed May 29, 2024

View reviewed changes

Update src/passes/SignaturePruning.cpp

2213028

Co-authored-by: Thomas Lively <[email protected]>

tlively approved these changes May 29, 2024

View reviewed changes

kripken merged commit b85197c into WebAssembly:main May 29, 2024
13 checks passed

kripken deleted the sig.prune.private branch May 29, 2024 23:48

This was referenced Jun 4, 2024

GTO problem with shared rec groups #6640

Open

[Strings] Keep public and private types separate in StringLowering #6642

Merged

gkdn mentioned this pull request Aug 31, 2024

stringconsts gkdn/binaryen#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SignaturePruning: Properly handle public types #6630

SignaturePruning: Properly handle public types #6630

kripken commented May 29, 2024

tlively May 29, 2024

kripken May 29, 2024

tlively May 29, 2024

kripken May 29, 2024

kripken May 29, 2024

tlively left a comment

SignaturePruning: Properly handle public types #6630

SignaturePruning: Properly handle public types #6630

Conversation

kripken commented May 29, 2024

tlively May 29, 2024

Choose a reason for hiding this comment

kripken May 29, 2024

Choose a reason for hiding this comment

tlively May 29, 2024

Choose a reason for hiding this comment

kripken May 29, 2024

Choose a reason for hiding this comment

kripken May 29, 2024

Choose a reason for hiding this comment

tlively left a comment

Choose a reason for hiding this comment