-
Notifications
You must be signed in to change notification settings - Fork 12.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal: stabilize const_refs_to_static
#128183
Comments
@rustbot labels +I-lang-nominated +I-types-nominated T-lang T-types Hello @rust-lang/lang and @rust-lang/wg-const-eval (cc @rust-lang/types), I am proposing to stabilize the Procedural note: I am not sure if this is T-lang or T-types or both, so I've opted to nominate and tag for both teams. |
@rustbot labels -T-types On the procedural question, in earlier discussion, @oli-obk said:
...and this sounds right to me. To my eyes, this seems distinctly a lang matter, and that's also how the tracking issue was tagged by @RalfJung and how it has remained tagged. So I'm going to pull off the |
@rfcbot fcp merge This stabilization looks correct and conservative to me. The analysis by @nikomatsakis is thorough and answers all the questions I had. I don't see any doors of any concern that we're closing here. This feature is desirable. It's known to be useful for Rust-for-Linux and will be useful for async-related purposes in the standard library. Thanks to @nikomatsakis for putting together this careful write-up and to @RalfJung, @oli-obk, @BoxyUwU, and others for helping through discussion to shape and refine it. |
Team member @traviscross has proposed to merge this. The next step is review by the rest of the tagged team members: Concerns:
Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up! cc @rust-lang/lang-advisors: FCP proposed for lang, please feel free to register concerns. |
The doesn't seem quite right to me 🤔 It's totally fine for const evaluation to result in values containing use std::cell::UnsafeCell;
const FOO: UnsafeCell<u32> = UnsafeCell::new(1);
fn main() {
unsafe { *FOO.get() = 2 };
unsafe { assert_eq!(*FOO.get(), 1) };
} It's also fine for const eval to access the content of an #![feature(const_refs_to_cell)]
#![feature(const_mut_refs)]
use std::cell::UnsafeCell;
const S: u32 = {
let x = UnsafeCell::new(1);
unsafe { *x.get() }
}; I guess the rules should be something like the following?
|
Specifically, const-eval can never read from (or return a reference to) any mutable static -- that includes And indeed it cannot return a reference to |
as-of #125834 the final examples can be written #![feature(const_refs_to_static)]
#![feature(const_mut_refs)]
extern {
static S_EXT: u32;
}
static mut S_MUT: u32 = 0;
// OK
const EXT: *const u32 = std::ptr::addr_of!(S_EXT);e
// OK, requires feature(const_mut_refs)
const MUT: *mut u32 = std::ptr::addr_of_mut!(S_MUT); i.e. they are not |
🔔 This is now entering its final comment period, as per the review above. 🔔 |
I would phrase this as: Const evaluation is not allowed to
"Safely reference" here refers to recursively traversing the value in the same way safe code could (but ignoring visibility), i.e. recursing through references but not through raw pointers or unions. |
Thanks @RalfJung, I'll update. |
@rustbot labels -I-lang-nominated We discussed this in lang triage today. People felt good about this, and it's now in FCP. |
Sorry, I'd like to review this before stabilization but haven't had time yet. @rfcbot concern tmandry review |
🔔 This is now entering its final comment period, as per the review above. 🔔 |
The final comment period, with a disposition to merge, as per the review above, is now complete. As the automated representative of the governance process, I would like to thank the author for their work and everyone else who contributed. This will be merged soon. |
…efs-to-static, r=petrochenkov Stabilize `const_refs_to_static` Close rust-lang#128183 Tracked by rust-lang#119618 cc `@nikomatsakis` Meanwhile, I am cooking a sub-section in the language reference.
Write-up source: https://hackmd.io/It7MysqPRzeuQZ2RZevz9w?edit
Summary
This document proposes to stabilize refs-to-static-in-constants. This feature permits one to create a constant expression that references a static:
On stable Rust, this stabilization does not introduce any surprising behavior. The resulting constants
C
will be equal to the address of&VT
at runtime -- note that this value is not knowable at compilation time (certainly not early on in the compilation) and so must be represented abstractly (i.e., the compiler thinks of the value ofC
as "the address ofVT
", whereasstd::ptr::null()
is an example of a constant pointer whose value is known to be 0).Given the limited surface area, this stabilization has no interactions with stable const generics. However, it does have some implications for future const generics; those are discussed in the Future interactions section. The conclusion is that supporting refs-to-statics does not introduce new challenges for const generics that were not already present in some other form.
Procedural note
Const-refs-to-statics never had an RFC. This stabilization report could be made into an RFC if we think that makes sense. Niko's general opinion is that incremental extensions to const generics do not all seem worthy of RFCs, and yet there is some value to establish principles like 'statics have significant addresses that ought to be preserved'.
What is being proposed for stabilization
Background: const evaluation and const values
The term const evaluation refers to evaluating a constant expression at compilation time. The result of const evaluation is a const value. Const values can contain abstract pointers (e.g., the result of
&VT
is "the address of the staticVT
") that are not truly known. We cannot always know whether two const values should be considered equal or whether they would compare as equal at runtimer. Const values are used to store the initializer for a named static (static S: T = /* initializer */
), the values of named constants (const C: T = ...
), and the values of associated constants (<T as Something>::SIZE
).What
const_refs_to_static
allowsCurrently, const evaluation does not allow a
const
value to reference astatic
. A program like this one therefore requires a feature gate:The feature gate allows not only reading the value of a static but also taking references (and even dereferencing them):
The "significant address" property
The key distinguishing feature of a
static
versus any other form of variable is that they have a significant address. In short,&S
for some staticS
is often expected to be the same pointer everywhere in the program whenever it occurs (but see the caveat below). This is distinct from a local variable, say, which may have different addresses on each invocation of the function1; it is also distinct from a constant likeconst { &22 }
, which can also refer to different memory locations (though they will always have22
).Const evaluation and constants preserve this property (playground):
In contrast, the pointer values of constants are not guaranteed to be equal, and hence equivalent assertions would not be guaranteed to be true for these declarations (playground):
Caveat (generic statics): Statics are currently forbidden from having generic parameters in large because it is not clear if and how the significant address property could be maintained given monomorphization. Future extensions of statics to support generics may revise the precise guarantee being offered here (e.g., to say that generic statics instantiated in distinct compilation units may sometimes have distinct addresses) and they would have to address how that interacts with constants.
Extern statics
Extern statics are treated conservatively. It is possible to get their address as a raw pointer but it is not possible to read from them (what would the value be) or to include a safe reference to them in your final value (playground):
Freeze requirement
Const evaluation is not allowed to
static mut
).&mut
)."Safely reference" here refers to recursively traversing the value in the same way safe code could (but ignoring visibility), i.e. recursing through references but not through raw pointers or unions.
It is possible to create static values with
UnsafeCell
contents, but they can not typically be used from constants except in very narrow ways. For example, creating a constant whose value includes anUnsafeCell
(or a reference to memory contained in an unsafe cell) triggers an error that "it is undefined behavior to use this value":Similarly attempting to access the contents of an unsafe cell results in "constant accesses mutable global memory":
It is however possible to use statics that have
UnsafeCell
in other ways, e.g. returning a raw pointer to their contents:Static mut
Statics declared as
static mut
generally behave "as if" they were enclosed in an unsafe cell (playground):The same is true of external statics (playground):
Future interactions
Const generics refers to Rust items with generic parameters of kind
const
, such asfn foo<const C: usize>()
. Stable Rust requires that const generic parameters have simple scalar types likeusize
ori32
. This limitation means that there is no real interaction between the stable surface area of const generics andconst_refs_to_static
.So long as we do not extend const generics to permit values of
&
-type, then there are no problems at all (but of course we limit what users can do, and in particular don't support&str
values). If however we wish to extend const generics to permit parameters of&
-type (e.g.,fn foo<const C: &usize>()
), then we will need to extend the current implementation to preserve the "significant address" property. This section dives into detail as to why that property is not currently preserved, the various options to fix that, and some related challenges.Background: Const generics and monomorphization
Given a function
fn foo<const C: SomeType>()
, Rust's type system must be able to decide whetherfoo::<X>
andfoo::<Y>
represent two different instances of the same generic function (or, equivalently, givenstruct Foo<const C: SomeType>
, whetherFoo<X>
andFoo<Y>
are the same type). This requires being able to determine whetherX
andY
are equal (i.e., the same value). This equality comparison cannot be done for all const values since some of them lack a well-defined notion of equality (e.g., two values of typefn()
). Stable Rust sidesteps this issue by only permitting const generics where the type is a scalar value (e.g.,u32
) and the constant expression can be evaluated to a fixed constant (in particular, the expression is not allowed to reference generic types).Introducing valtrees
To support a richer set of values in const generics, nightly Rust makes use of valtrees. A valtree ("value tree") is a simplified form of const value consisting of "branch nodes" and "leaf nodes", which carry simple scalar values. The "value" of a const generic parameter is always a valtree, not an arbitrary const value.
For the simple types supported in const generics today, valtree conversion is infallible -- simply convert the scalar value to a leaf node. The same is true for ADTs composed of those simple types. Converting a
(u32, u32)
tuple like(22, 44)
for example simply means you get a valtree like(I32LeafNode(22_i32), I32LeafNode(44_i32))
.Valtrees do not carry type information. The same valtree
(I32LeafNode(22_i32), I32LeafNode(44_i32))
that represents a tuple would also represent a fixed-length array like[22, 44]
or a value ofstruct Point { x: u32, y: u32 }
. At monomorphization time, generic constants have both a type and an associated valtree suitable for that type, and that type can be used to instantiate the valtree into an actual value.Values of more complex types may not have a well-defined valtree. For example, there is no way to represent a
fn()
value as a valtree. In the nightly version of const generics, whenever a const value is given as the value for a const generic, the compiler internally attempts to convert that const value to a valtree. This process can fail, in which case an error results. But if it succeeds, then the const generic can be compiled. Whenever the const generic argument is referenced, the valtree will be converted into a const value which can in turn be converted into a real value at runtime.Example. Let's walk through an example supported on stable today:
main
, the expression22 + 44
is const evaluated into a const valueConstVal(66)
.ConstVal(66)
is then converted into a valtreeI32LeafNode(66)
.test::<I32LeafNode(66)>
is compiled.let x = C
is compiled,I32LeafNode(66)
is converted back toConstVal(66)
and from there the code is compiled to load a constant. Execution proceeds as expected.Supporting references in valtrees
As currently implemented, references are ignored when creating a valtree, so the valtrees for
22
and&22
and even&&22
are all the same (justI32LeafNode(22)
). This preserves the property that, given two valuesX
andY
, ifvaltree(X) == valtree(Y)
thenx == y
. For refrences, this means that pointer equalty ought not be considered part of identity, since the==
operator for&T
says that two references are equal if their referents are equal (and it doesn't consider the pointer address). Put another way, theEq
trait doesn't respect "significant addresses", and valtrees are currently defined to align withEq
, so they do not either.The current definition of valtrees implies that const generics of type
&usize
(or any reference) will preserve the value of the referent but not its address (as that is not part of the valtree). This can create observable behavior on nightly. Consider this example from #120961:When executed, this example does NOT print anything, even though you might expect that it would. What is happening?
BAR
isConstVal(&FOO)
, which tracks that it is the address of the staticFOO
.BAR
is converted into a valtree, which results in just42
(the value of the static is used to create the valtree).foo::<Leaf(64)>
is compiled, the valtree must be converted into a&usize
. A new temporary value is synthesized. Thestr::ptr::eq
(which observes the physical pointer address) compares the address of this temporary toFOO
and they have different addresses.const BAZ: &'static u32 = &42
would typically be equal toX
, but that is because LLVM deduplicates such constants into a single allocation; such deduplication is also not guaranteed to occur, particularly across codegen units.There is general agreement that this behavior is surprising and not desirable. But note that it requires multiple feature gates --
const_refs_to_static
ANDadt_const_params
(and as of very recently,unsized_const_params
). Stabilizing justconst_refs_to_static
does not really change anything. In other words, the problem with the above example is not due to permitting references to statics in constants, it's due to valtrees encoding references in a surprising way (though if you didn't have references to statics, you couldn't observe it).Options to support references in const generics
So, what are the options for supporting reference types in const generics, while avoiding surprising examples like the one from #120961 above?
Option A: Disallow creating valtrees from references to statics
We could make valtree construction fail if it encounters a reference to a static (but succeed for references to anonymous constants). This would avoid the issues but only be preventing users from doing something they likely want to do. This program would not compile, for example, since it invokes
foo
with the constant&S
:This option is not very appealing, ecause users likely want to create valtrees that reference statics.
Option B: Extend valtree to represent ref-to-static
A more appealing option is to extend valtrees so that "ref-to-static" is something they can directly encode, and thus sacrifice the invariant that
valtree(X) == valtree(Y)
impliesX == Y
. This recognizes the fact that there are additional properties to values that we may wish to preserve beyond what is compared by theEq
trait. Significant addresses are not the only examples of such properties, there are many that arise when const functions use unsafe code, such as the value of padding bits, provenance, and potentially things like which NaN is in use (if we wished to supportf64
). We will have to decide which of them we wish to make observable in const evaluation.The upshot: Stuff to figure out, but refs-to-statics doesn't make it harder
As BoxyUwu put it:
Links
Footnotes
And potentially even within a single function call, if the value is moved or becomes dead -- though arguably that is a separate variable. Precise limitations here still TBD. ↩
The text was updated successfully, but these errors were encountered: