Relaxing closed world validation and improving open world optimization #6965

tlively · 2024-09-23T21:11:53Z

The --closed-world flag lets us assume that we can make arbitrary changes to types as long as those types are not part of the module's contract with the outside world. Since the type system is structural, there is not a single, precise definition of what it means for a type to be "part of the module's contract," but we have chosen it to mean that we will keep the types of exported or imported module elements the same, but all other types are fair game. In particular, we assume we are allowed to modify subtypes of public types that are not themselves public. Otherwise a single anyref in an exported function would prevent us from modifying any struct or array type and a single funcref in an eported function would prevent us from modifying signatures of referenced functions.

However, our current closed-world validation is much stricter than this. It additionally restricts what types are allowed to be public. It allows the types of exported and imported functions to be public, and therefore must also allow all types in the rec groups of those function types to be public, but it does not allow any other defined heap types to be public, even if they are part of the type of an imported or exported function.

I believe the original motivation for these additional restrictions was that we wanted to be able to optimize as many types as possible, so we didn't want to allow users to expose types in a way that would inhibit optimizations. But this is putting the cart before the horse. We should be able to optimize any module we are given according to the assumptions configured via command line options, and there is no user benefit if we simply reject modules that they want to optimize because we cannot optimize it as well as some different module they could have given us. Users (such as Kotlin) are running into these errors when they try to use smaller rec groups in their input.

Here is the state of the world I would like to move to:

All types are allowed to be used in a module's public interface in both open and closed world modes.
The only difference between open and closed world modes is whether subtypes of public types are considered public by default.
All type optimization passes work equally well in both modes, using the classified public and private types as their sources of truth for what types are allowed to be modified.
We have a (@private) type annotation that allows types that would otherwise be considered public to be considered private instead. It is an error for a (@private) type to be used in a module's public interface, so this is only useful for annotating subtypes of public types in open world mode.
We have a (@public) type annotation that allows types that would otherwise be considered private to be considered public instead.
Neither type annotation affects how public visibility propagates to subtypes.
It is an error if a single module annotates the same type as both (@private) and (@public) (even if the annotations are on different definitions of the same type).

Here are the steps necessary to get to that state of the world:

Add a temporary --relaxed-closed-world flag that behaves like --closed-world but allows any type to be public.
Get the fuzzer running cleanly with --relaxed-closed-world instead of --closed-world.
Remove --relaxed-closed-world and allow any type to be public with --closed-world.
Implement propagation of public visibility to subtypes in open world mode.
Design a custom section framework for arbitrary type annotations like we have for code annotations.
Implement (@private) and (@public) annotations.

@kripken, WDYT?

The text was updated successfully, but these errors were encountered:

kripken · 2024-09-23T22:57:30Z

Sounds good!

Might be worth mentioning externref here. I assume an exported/imported externref is handled similarly to anyref?
We want to still preserve the key property in closed world that one can send a reference out but the outside cannot inspect (for an array or struct) or call (for a function) that ref. That is, that the outside can cache the reference and send it back in, but not interact with it. Atm in closed world we achieve that by sending out anyref/externref, and not the specific GC type, but maybe there's a better way, e.g., sending out the specific GC type but annotating it as private. I don't feel strongly here.

tlively · 2024-09-24T00:47:44Z

Might be worth mentioning externref here. I assume an exported/imported externref is handled similarly to anyref?

Yes, good point. Externrefs in the public interface should be treated as though they were also anyrefs and vice versa.

We want to still preserve the key property in closed world that one can send a reference out but the outside cannot inspect (for an array or struct) or call (for a function) that ref. That is, that the outside can cache the reference and send it back in, but not interact with it. Atm in closed world we achieve that by sending out anyref/externref, and not the specific GC type, but maybe there's a better way, e.g., sending out the specific GC type but annotating it as private. I don't feel strongly here.

I think that use case will have to continue using abstract heap types like anyref and externref on the boundary. If we allowed a defined type to be passed out directly, then even if we assume the environment will not access it directly, changing it would still change the type of the function that passes it out. That's fine in a JS embedder, but not in any statically typed embedder. If we want to allow this anyway, we could use the (@private) annotation and not make it an error to use (@private) types in the module interface.

These were added to avoid common problems with closed world mode, but in practice they are causing more harm than good, forcing users to work around them. In the meantime (until #6965), remove this validation to unblock current toolchain makers. Fix GlobalTypeOptimization and AbstractTypeRefining on issues that this uncovers: without this validation, it is possible to run them on more wasm files than before, hence these were not previously detected. They are bundled in this PR because their tests cannot validate before this PR.

This was referenced Oct 17, 2024

wasm-opt error when closed-world option enabled #7015

Open

[GC] Ignore public types in GlobalTypeOptimization #7017

Merged

kripken mentioned this issue Oct 17, 2024

Remove closed world validation checks #7019

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relaxing closed world validation and improving open world optimization #6965

Relaxing closed world validation and improving open world optimization #6965

tlively commented Sep 23, 2024

kripken commented Sep 23, 2024

tlively commented Sep 24, 2024 •

edited

Loading

Relaxing closed world validation and improving open world optimization #6965

Relaxing closed world validation and improving open world optimization #6965

Comments

tlively commented Sep 23, 2024

kripken commented Sep 23, 2024

tlively commented Sep 24, 2024 • edited Loading

tlively commented Sep 24, 2024 •

edited

Loading