Safer Transmute RFC

Feature Name: safer_transmute
Start Date: 2020-08-31
RFC PR: rust-lang/rfcs#2981
Rust Issue: rust-lang/rust#0000

Summary

We propose traits, namely TransmuteFrom, that are implemented automatically for combinations of types that may be safely transmuted. In other words, this RFC makes safe transmutation as easy as 1..., 2..., repr(C)!

#[derive(Muckable)]
#[repr(C)]
pub struct Foo(pub u8, pub u16);
//                    ^ there's a padding byte here, between these fields

// Transmute fearlessly!
let _ : Foo = transmute!(64u32); // Alchemy Achieved!

let _ : u32 = transmute!(Foo(16, 12)); // Compile Error!

// error[E0277]: the trait bound `u32: TransmuteFrom<Foo, _>` is not satisfied
//   --> src/demo.rs:7:27
//    |
//  7 | let _ : u32 = transmute!(Foo(16, 12)); // Compile Error!
//    |                          ^^^^^^^^^^^ the trait `TransmuteFrom<Foo, _, _>` is not implemented for `u32`
//    |
//   = note: byte 8 of the source type may be uninitialized; byte 8 of the destination type cannot be uninitialized.

Motivation

Byte-reinterpretation conversions (such as those performed by mem::transmute, mem::transmute_copy, pointer casts, and unions) are invaluable in high performance contexts, are unsafe, and easy to get wrong. This RFC provides mechanisms that make many currently-unsafe transmutations entirely safe. For transmutations that are not entirely safe, this RFC's mechanisms make mistakes harder to make.

This RFC's comprehensive approach provides additional benefits beyond the mere act of transmutation; namely:

authoritatively codifies language layout guarantees
allows crate authors to codify their abstractions' layout requirements

Given the expressive foundation provided by this RFC, we also envision a range of future possibilities that will not require additional compiler support, including:

safe slice and Vec casting
a unified, generic Atomic<T> type
a safe, generic alternative to include_bytes!
traits for asserting the size and alignment relationships of types
zerocopy-style traits for safe initialization
bytemuck-style mechanisms for fallible reference casting

Codifying Language Layout Guarantees

Documentation of Rust's layout guarantees for a type are often spread across countless issues, pull requests, RFCs and various official resources. It can be very difficult to get a straight answer. When transmutation is involved, users must reason about the combined layout properties of the source and destination types.

This RFC proposes mechanisms that programmers will use to confidently answer such questions—by checking whether the TransmuteFrom trait is implemented.

Expressing Layout Requirements

Similarly, there is no canonical way for crate authors to declare the layout requirements of generic abstractions over types that have certain layout properties.

For instance, a common bit-packing technique involves using the relationship between allocations and alignment. If a type is aligned to 2ⁿ, then the n least significant bits of pointers to that type will equal 0. These known-zero bits can be packed with data. Since alignment cannot be currently reasoned about at the type-level, it's currently impossible to bound instantiations of a generic parameter based on minimum alignment.

The mechanisms proposed by the RFC enable this, see here.

Guide-level explanation

Terminology & Concepts

📖 Transmutation

Transmutation is the act of reinterpreting the bytes corresponding to a value of one type as if they corresponded to a different type. Concretely, we mean the behavior of this function:

#[inline(always)]
unsafe fn transmute<Src, Dst>(src: Src) -> Dst
{
    #[repr(C)]
    union Transmute<Src, Dst> {
        src: ManuallyDrop<Src>,
        dst: ManuallyDrop<Dst>,
    }

    ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
}

📖 Safer Transmutation

By safer transmutation we mean: what where bound could we add to transmute restricts its type parameters Src and Dst in ways that statically limit the function's misuse? Our answer to this question will ensure that transmutations are, by default, well-defined and safe.

📖 Well-Definedness

A transmutation is well-defined if the mere act of transmuting a value from one type to another is not unspecified or undefined behavior.

📖 Safety

A well-defined transmutation is safe if using the transmuted value cannot violate memory safety.

📖 Stability

A safe transmutation is stable if the authors of the source type and destination types have indicated that the layouts of those types is part of their libraries' stability guarantees.

Concepts in Depth

Disclaimer: While the high-level definitions of transmutation well-definedness and safety is a core component of this RFC, the detailed rules and examples in this section are not. We expect that the initial implementation of TransmuteFrom may initially be considerably less sophisticated than the examples in this section (and thus forbid valid transmutations). Nonetheless, this section explores nuanced cases of transmutation well-definedness and safety to demonstrate that the APIs we propose can grow to handle that nuance.

📖 When is a transmutation well-defined?

A transmutation is well-defined if the mere act of transmuting a value from one type to another is not unspecified or undefined behavior.

Well-Defined Representation

Transmutation is ill-defined if it occurs between types with unspecified representations.

Most of Rust's primitive types have specified representations. That is, the precise layout characteristics of u8, f32 is a documented and guaranteed aspect of those types.

In contrast, most struct and enum types defined without an explicit #[repr(C)] attribute do not have well-specified layout characteristics.

To ensure that types you've define are transmutable, you almost always (with very few exceptions) must mark them with the #[repr(C)] attribute.

Requirements on Owned Values

Transmutations involving owned values must adhere to two rules to be well-defined. They must:

preserve or broaden the bit validity, and
preserve or shrink the size.

Preserve or Broaden Bit Validity

The bits of any valid instance of the source type must be a bit-valid instance of the destination type.

For example, we are permitted to transmute a Bool into a u8:

#[derive(Muckable)]
#[repr(u8)]
enum Bool {
    True = 1,
    False = 0,
}

let _ : u8 = transmute!(Bool::True);
let _ : u8 = transmute!(Bool::False);

...because all possible instances of Bool are also valid instances of u8. However, transmuting a u8 into a Bool is forbidden:

/* ⚠️ This example intentionally does not compile. */
let _ : Bool = transmute!(u8::default()); // Compile Error!

...because not all instances of u8 are valid instances of Bool.

Another example: While laying out certain types, Rust may insert padding bytes between the layouts of fields. In the below example Padded has two padding bytes, while Packed has none:

#[repr(C)]
#[derive(Default, Muckable)]
struct Padded(pub u8, pub u16, pub u8);

#[repr(C)]
#[derive(Default, Muckable)]
struct Packed(pub u16, pub u16, pub u16);

assert_eq!(mem::size_of::<Packed>(), mem::size_of::<Padded>());

We may safely transmute from Packed to Padded:

let _ : Padded = transmute!(Packed::default());

...but not from Padded to Packed:

/* ⚠️ This example intentionally does not compile. */
let _ : Packed = transmute!(Padded::default()); // Compile Error!

...because doing so would expose two uninitialized padding bytes in Padded as if they were initialized bytes in Packed.

Preserve or Shrink Size

It's well-defined to transmute into a type with fewer bytes than the source type; e.g.:

let _ : [u8; 16] = transmute!([u8; 32]::default());

This transmute truncates away the final sixteen bytes of the [u8; 32] value.

A value may not be transmuted into a type of greater size, if doing so would expose uninitialized bytes as initialized:

/* ⚠️ This example intentionally does not compile. */
let _ : [u8; 32] = transmute!([u8; 16]::default()); // Compile Error!

A differing_sizes lint reports warnings for invocations of transmute!() where the source and destination types are different sizes.

Requirements on References

The restrictions above that apply to transmuting owned values also apply to transmuting references. However, references carry a few additional restrictions.

A [well-defined transmutation] of references must:

preserve or shrink size,
preserve or relax alignment,
preserve or shrink lifetimes,
preserve or shrink uniqueness, and
and if the destination type is a mutate-able reference, preserve validity.

Preserve or Shrink Size

You may preserve or decrease the size of the referent type via transmutation:

let _: &[u8; 3] = transmute!(&[0u8; 9]);

However, you may not, under any circumstances, increase the size of the referent type:

/* ⚠️ This example intentionally does not compile. */
let _: &[u8; 9] = transmute!(&[0u8; 3]); // Compile Error!

Preserve or Relax Alignment

Unaligned loads are undefined behavior. You may transmute a reference into reference of more relaxed alignment:

let _: &[u8; 0] = transmute!(&[0u16; 0]);

However, you may not transmute a reference into a reference of more-restrictive alignment:

/* ⚠️ This example intentionally does not compile. */
let _: &[u16; 0] = transmute!(&[0u8; 0]); // Compile Error!

Preserve or Shrink Lifetimes

You may transmute a reference into a reference of lesser lifetime:

fn shrink<'a>() -> &'a u8 {
    static long : &'static u8 = &16;
    transmute!(long)
}

However, you may not transmute a reference into a reference of greater lifetime:

/* ⚠️ This example intentionally does not compile. */
fn extend<'a>(short: &'a u8) -> &'static u8 {
    transmute!(short) // Compile Error!
}

Preserve or Shrink Uniqueness

You may preserve or decrease the uniqueness of a reference through transmutation:

let _: &u8 = transmute!(&42u8);
let _: &u8 = transmute!(&mut 42u8);

However, you may not transmute a shared reference into a unique reference:

/* ⚠️ This example intentionally does not compile. */
let _: &mut u8 = transmute!(&42u8); // Compile Error!

Mutate-able References Must Preserve Validity

A mutate-able reference is:

all unique (i.e., &mut T) references
all shared (i.e., &T) references whose referent type contain any bytes produced by the contents of UnsafeCell.

Unlike transmutations of owned values, the transmutation of a mutate-able reference may also not expand the bit-validity of the referenced type. For instance:

/* ⚠️ This example intentionally does not compile. */
let mut x = NonZeroU8::new(42).unwrap();
{
    let y : &mut u8 = transmute!(&mut x); // Compile Error!
    *y = 0;
}

let z : NonZeroU8 = x;

If this example did not produce a compile error, the value of z would not be a bit-valid instance of its type, NonZeroU8.

📖 When is a transmutation safe and stable?

A well-defined transmutation is safe if using the transmuted value safely cannot violate memory safety. Whereas well-definedness solely concerns the act of transmutation, safety is concerned with what might happen with a value after transmutation occurs. Since transmutation provides a mechanism for arbitrarily reading and modifying the bytes of a type, a well-defined transmutation is not necessarily safe, nor stable.

Well-Definedness Does Not Imply Safety

For instance, consider the type NonEmptySlice, which enforces a validity constraint on its fields via privacy and its constructor from_array:

pub mod crate_a {

    #[repr(C)]
    pub struct NonEmptySlice<'a, T> {
        data: *const T,
        len: usize,
        lifetime: core::marker::PhantomData<&'a ()>,
    }

    impl<'a, T> NonEmptySlice<'a, T> {
        pub fn from_array<const N: usize>(arr: &'a [T; N], len: usize) -> Self {
            assert!(len <= N);
            assert!(len > 0);
            Self {
                data: arr as *const T,
                len,
                lifetime: core::marker::PhantomData,
            }
        }

        pub fn first(&self) -> &'a T {
            unsafe { &*self.data }
        }
    }

}

It is sound for first to be a safe method is because the from_array constructor ensures that data is safe to dereference, and because from_array is the only way to safely initialize NonEmptySlice outside of crate_a (note that NonEmptySlice's fields are not pub).

However, transmutation supplies a mechanism for constructing instances of a type without invoking its implicit constructor, nor any constructors defined by the type's author. In the previous examples, it would be unsafe to transmute [usize; 2] into NonEmptySlice outside crate_a, because subsequent safe use of that value (namely, calling first) would violate memory safety:

/* ⚠️ This example intentionally does not compile. */
// [usize; 2] ⟶ NonEmptySlice
let _: NonEmptySlice<'static, u8> = transmute!([0usize; 2]); // Compile Error: `NonEmptySlice<_, _>` is not safely transmutable from `[usize; 2]`.

Well-Definedness Does Not Imply Stability

Since the well-definedness of a transmutation is affected by the layouts of the source and destination types, internal changes to those types' layouts may cause code which previously compiled to produce errors. In other words, transmutation causes a type's layout to become part of that type's API for the purposes of SemVer stability.

Signaling Safety and Stability with `Muckable`

To signal that your type may be safely and stably constructed via transmutation, implement the Muckable marker trait:

use mem::transmute::Muckable;

#[derive(Muckable)]
#[repr(C)]
pub struct Foo(pub u8, pub u16);

The Muckable marker trait signals that your type's fields may be safely initialized and modified to any value. If you would not be comfortable making your type's fields pub, you probably should not implement Muckable for your type. By implementing Muckable, you promise to treat any observable modification to your type's layout as a breaking change. (Unobservable changes, such as renaming a private field, are fine.)

As a rule, the destination type of a transmutation must be Muckable.

For transmutations where the destination type involves mutate-able references, the Muckability of the source type is also relevant. Consider:

/* ⚠️ This example intentionally does not compile. */
let arr = [0u8, 1u8, 2u8];
let mut x = NonEmptySlice::from_array(&arr, 2);
{
    // &mut NonEmptySlice ⟶ &mut [usize; 2]
    let y : &mut u128 = transmute!(&mut x) // Compile Error! `&mut NonEmptySlice` is not safely transmutable from `&mut u128`.
    *y[0] = 0;
    *y[1] = 0;
}

let z : NonEmptySlice<u8> = x;

If this example did not produce a compile error, the value of z would not be a safe instance of its type, NonEmptySlice, because z.first() would dereference a null pointer.

Mechanisms of Transmutation

The TransmuteFrom trait provides the fundamental mechanism checking the transmutability of types:

// this trait is implemented automagically by the compiler
#[lang = "transmute_from"]
pub unsafe trait TransmuteFrom<Src: ?Sized, Neglect = ()>
where
    Neglect: TransmuteOptions,
{
    #[inline(always)]
    fn transmute_from(src: Src) -> Self
    where
        Src: Sized,
        Self: Sized,
        Neglect: SafeTransmuteOptions,
    {
        unsafe { Self::unsafe_transmute_from(src) }
    }

    #[inline(always)]
    unsafe fn unsafe_transmute_from(src: Src) -> Self
    where
        Src: Sized,
        Self: Sized,
        Neglect: TransmuteOptions,
    {
        use core::mem::ManuallyDrop;

        #[repr(C)]
        union Transmute<Src, Dst> {
            src: ManuallyDrop<Src>,
            dst: ManuallyDrop<Dst>,
        }

        unsafe {
            ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
        }
    }
}

In the above definitions, Src represents the source type of the transmutation, Dst represents the destination type of the transmutation, and Neglect is a parameter that encodes which static checks the compiler ought to neglect when considering if a transmutation is valid. The default value of Neglect is (), which reflects that, by default, the compiler does not neglect any static checks.

The transmute! macro provides a shorthand for safely transmuting a value:

pub macro transmute($expr: expr) {
    core::convert::transmute::TransmuteFrom::<_>::transmute_from($expr)
    //              ┯
    //              ┕ the destination type of the transmute (`_` used to infer the type from context)
}

A differing_sizes lint warns when the source and destination types of a transmutation (conducted via transmute! or transmute_from) have different sizes.

Neglecting Static Checks

The default value of the Neglect parameter, (), statically forbids transmutes that are ill-defined or unsafe. However, you may explicitly opt-out of some static checks; namely:

Transmute Option	Usable With
`NeglectAlignment`	`unsafe_transmute_{from,into}`
`NeglectValidity`	`unsafe_transmute_{from,into}`
`NeglectSafety`	`unsafe_transmute_{from,into}`

The selection of multiple options is encoded by grouping them as a tuple; e.g., (NeglectAlignment, NeglectValidity) is a selection of both the NeglectAlignment and NeglectValidity options.

We introduce two marker traits which serve to group together the options that may be used with safe transmutes, and those which may be used with unsafe transmutes:

pub trait SafeTransmuteOptions: private::Sealed
{}

pub trait TransmuteOptions: SafeTransmuteOptions
{}

impl SafeTransmuteOptions for () {}
impl TransmuteOptions for () {}

`NeglectAlignment`

By default, TransmuteFrom's methods require that, when transmuting references, the minimum alignment of the destination's referent type is no greater than the minimum alignment of the source's referent type. The NeglectAlignment option disables this requirement.

pub struct NeglectAlignment;

impl TransmuteOptions for NeglectAlignment {}

By using the NeglectAlignment option, you are committing to ensure that the transmuted reference satisfies the alignment requirements of the destination's referent type. For instance:

/// Try to convert a `&T` into `&U`.
///
/// This produces `None` if the referent isn't appropriately
/// aligned, as required by the destination type.
pub fn try_cast_ref<'t, 'u, T, U>(src: &'t T) -> Option<&'u U>
where
    &'t T: TransmuteFrom<&'u U, NeglectAlignment>,
{
    if (src as *const T as usize) % align_of::<U>() != 0 {
        None
    } else {
        // Safe because we dynamically enforce the alignment
        // requirement, whose static check we chose to neglect.
        Some(unsafe { TransmuteFrom::unsafe_transmute_from(src) })
    }
}

`NeglectValidity`

By default, TransmuteFrom's methods require that all instantiations of the source type are guaranteed to be valid instantiations of the destination type. This precludes transmutations which might be valid depending on the source value:

#[repr(u8)]
enum Bool {
    True = 1,
    False = 0,
}

/* ⚠️ This example intentionally does not compile. */
let _ : Bool  = transmute!(some_u8_value); // Compile Error!

The NeglectValidity option disables this check.

pub struct NeglectValidity;

impl TransmuteOptions for NeglectValidity {}

By using the NeglectValidity option, you are committing to ensure dynamically source value is a valid instance of the destination type. For instance:

#[repr(u8)]
enum Bool {
    True = 1,
    False = 0,
}

pub trait TryIntoBool
{
    fn try_into_bool(self) -> Option<Bool>;
}

impl<T> TryIntoBool for T
where
    u8: TransmuteFrom<T>,
    Bool: TransmuteFrom<u8, NeglectValidity>
{
    fn try_into_bool(self) -> Option<Bool> {
        let val: u8 = TransmuteFrom::transmute_from(self);

        if val > 1 {
            None
        } else {
            // Safe, because we've first verified that
            // `val` is a bit-valid instance of a boolean.
            Some(unsafe {TransmuteFrom::unsafe_transmute_from(val)})
        }
    }
}

Even with NeglectValidity, the compiler will statically reject transmutations that cannot possibly be valid:

#[repr(C)] enum Foo { A = 24 }

#[repr(C)] enum Bar { Z = 42 }

let _ = <Bar as TransmuteFrom<Foo, NeglectValidity>::unsafe_transmute_from(Foo::N) // Compile error!

`NeglectSafety`

By default, TransmuteFrom's methods require that all instantiations of the source type are Muckable. If the destination type is a mutate-able reference, the source type must also be Muckable. This precludes transmutations that are well-defined but not safe.

Whether the bound Dst: TransmuteFrom<Src, NeglectSafety> is implemented depends solely on the compiler's analysis of the layouts of Src and Dst (see When is a transmutation well-defined?)—and not the opt-in of the authors of Src and Dst. When using this option, the onus is on you to ensure that you are adhering to the documented layout and library validity guarantees of the involved types.

You might use this option if the involved types predate the Muckable trait (e.g., old versions of libc). For instance, checking libc::in6_addr: TransmuteFrom<Src, NeglectSafety> is better than nothing; it statically ensures that the transmutation from Src to libc::in6_addr is well-defined.

You might also use this option to signal that a particular transmutation is stable without implementing Muckable (which would signal that all transmutations are stable):

impl From<Foo> for Bar
where
    Bar: TransmuteFrom<Foo, NeglectSafety>
{
    fn from(src: Foo) -> Self {
        unsafe { Bar::unsafe_transmute_from(src) }
    }
}

Reference-level explanation

Implementation Guidance

Two items in this RFC require special compiler support:

Muckable
TransmuteFrom
differing_sizes lint

Implementing `Muckable`

The Muckable marker trait is similar to Copy in that all fields of a Muckable type must, themselves, be Muckable.

Implementing `TransmuteFrom`

The implementation of TransmuteFrom is completely internal to the compiler (à la Sized and Freeze).

Constructability and Transmutation

A Src is safely transmutable into Dst in a given if:

Dst: Muckable
Dst: TransmuteFrom<Src, Neglect>
NeglectSafety ∉ Neglect

If Src is a mutatable reference, then additionally:

Src: Muckable

Implementing `differing_sizes` lint

The differing_sizes lint reports a compiler warning when the source and destination types of a transmute!(), transmute_from or unsafe_transmute_from invocation differ. This lint shall be warn-by-default.

Minimal Useful Stabilization Surface

Stabilizing only this subset of the Initial Smart Implementation will cover many use-cases:

transmute!()

To define traits that generically abstract over TransmuteFrom, these items must be stabilized:

TransmuteFrom
TransmuteOptions and SafeTransmuteOptions

Complete API Surface

This listing is the canonical specification of this RFC's API surface (playground):

#![feature(untagged_unions,const_fn,const_fn_union)] // for the impl of unsafe_transmute_from
#![feature(decl_macro)] // for `transmute!` and `#[derive(Muckable)]` macros
#![feature(const_generics)] // for stability declarations on `[T; N]`
#![feature(never_type)] // for stability declarations on `!`
#![allow(unused_unsafe, incomplete_features)]

/// Transmutation conversions.
// suggested location: `core::convert`
pub mod transmute {

    use options::*;

    /// Safely transmute $expr
    pub macro transmute($expr: expr) {
        core::convert::transmute::TransmuteFrom::<_>::transmute_from($expr)
    }

    /// `Self: TransmuteFrom<Src, Neglect`, if the compiler accepts
    /// the safety of transmuting `Src` into `Self`, notwithstanding
    /// a given set of static checks to `Neglect`.
    pub unsafe trait TransmuteFrom<Src: ?Sized, Neglect = ()>
    where
        Neglect: TransmuteOptions,
    {
        /// Reinterpret the bits of a value of one type as another type, safely.
        #[inline(always)]
        fn transmute_from(src: Src) -> Self
        where
            Src: Sized,
            Self: Sized,
            Neglect: SafeTransmuteOptions,
        {
            unsafe { Self::unsafe_transmute_from(src) }
        }

        /// Reinterpret the bits of a value of one type as another type, potentially unsafely.
        ///
        /// The onus is on you to ensure that calling this function is safe.
        #[inline(always)]
        unsafe fn unsafe_transmute_from(src: Src) -> Self
        where
            Src: Sized,
            Self: Sized,
            Neglect: TransmuteOptions,
        {
            use core::mem::ManuallyDrop;

            #[repr(C)]
            union Transmute<Src, Dst> {
                src: ManuallyDrop<Src>,
                dst: ManuallyDrop<Dst>,
            }

            unsafe {
                ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
            }
        }
    }

    /// Static checks that may be neglected when determining if a type is `TransmuteFrom` some other type.
    pub mod options {

        /// Options that may be used with safe transmutations.
        pub trait SafeTransmuteOptions: TransmuteOptions {}

        /// `()` denotes that no static checks should be neglected.
        impl SafeTransmuteOptions for () {}

        /// Options that may be used with unsafe transmutations.
        pub trait TransmuteOptions: private::Sealed {}

        /// Neglect the alignment check of `TransmuteFrom`.
        pub struct NeglectAlignment;

        /// Neglect the validity check of `TransmuteFrom`.
        pub struct NeglectValidity;

        /// Neglect the safety check of `TransmuteFrom`.
        pub struct NeglectSafety;

        impl TransmuteOptions for () {}
        impl TransmuteOptions for NeglectAlignment {}
        impl TransmuteOptions for NeglectValidity {}
        impl TransmuteOptions for NeglectSafety {}
        impl TransmuteOptions for (NeglectAlignment, NeglectValidity) {}
        impl TransmuteOptions for (NeglectAlignment, NeglectSafety) {}
        impl TransmuteOptions for (NeglectSafety, NeglectValidity) {}
        impl TransmuteOptions for (NeglectAlignment, NeglectSafety, NeglectValidity) {}

        // prevent third-party implementations of `TransmuteOptions`
        mod private {
            use super::*;

            pub trait Sealed {}

            impl Sealed for () {}
            impl Sealed for NeglectAlignment {}
            impl Sealed for NeglectValidity {}
            impl Sealed for NeglectSafety {}
            impl Sealed for (NeglectAlignment, NeglectValidity) {}
            impl Sealed for (NeglectAlignment, NeglectSafety) {}
            impl Sealed for (NeglectSafety, NeglectValidity) {}
            impl Sealed for (NeglectAlignment, NeglectSafety, NeglectValidity) {}
        }
    }

    /// Traits for declaring the SemVer stability of types.
    pub mod stability {

        /// Denotes that `Self`'s fields may be arbitarily initialized or
        /// modified, regardless of their visibility. Implementing this trait
        /// additionally denotes that you will treat any observable changes to
        /// `Self`'s layout as breaking changes. (Unobservable changes, such as
        /// renaming a private field, are fine.)
        pub trait Muckable {}

        /// `#[derive(Muckable)]`
        pub macro Muckable($expr: expr) {
            /* stub */
        }

        impl Muckable for     ! {}
        impl Muckable for    () {}

        impl Muckable for   f32 {}
        impl Muckable for   f64 {}

        impl Muckable for    i8 {}
        impl Muckable for   i16 {}
        impl Muckable for   i32 {}
        impl Muckable for   i64 {}
        impl Muckable for  i128 {}
        impl Muckable for isize {}

        impl Muckable for    u8 {}
        impl Muckable for   u16 {}
        impl Muckable for   u32 {}
        impl Muckable for   u64 {}
        impl Muckable for  u128 {}
        impl Muckable for usize {}

        impl<T: ?Sized> Muckable for core::marker::PhantomData<T> {}

        impl<T, const N: usize> Muckable for [T; N]
        where
            T: Muckable,
        {}

        impl<T: ?Sized> Muckable for *const T
        where
            T: Muckable, /* discuss this bound */
        {}

        impl<T: ?Sized> Muckable for *mut T
        where
            T: Muckable, /* discuss this bound */
        {}

        impl<'a, T: ?Sized> Muckable for &'a T
        where
            T: Muckable,
        {}

        impl<'a, T: ?Sized> Muckable for &'a mut T
        where
            T: Muckable,
        {}
    }
}

Drawbacks

TODO

Rationale and alternatives

Rationale: `TransmuteFrom`

Why support arbitrary transmutation?

Some prior art, especially in the crate ecosystem, provides an API that only supports transmutations involving particular types (e.g., from/into bytes). As we discuss in the prior art section, we believe that the inflexibility of such approaches make them a poor basis of a language proposal. In particular, these restrictive approaches don't leave room to grow: supporting additional transmutations requires additional traits.

The API advocated by this proposal is unopinionated about what transmutations users might wish to do, and what transmutations the compiler is able to reason about. The implementation of this RFC may be initially very simple (and perhaps support no more than the restrictive approaches allow for), but then subsequently grow in sophistication—without necessitating public API changes.

Alternative: Implementing this RFC in a Crate

This RFC builds on ample prior art in the crate ecosystem, but these efforts strain against the fundamental limitations of crates. Fundamentally, safe transmutation efforts use traits to expose layout information to the type system. The burden of ensuring safety is usually either placed entirely on the end-user, or assumed by complex, incomplete proc-macro derives.

An exception to this rule is the typic crate, which utilizes complex, type-level programming to emulate a compiler-supported, "smart" TransmuteFrom trait (like the one proposed in this RFC). Nonetheless, typic is fundamentally limited: since Rust does not provide a type-level mechanism for reflecting over the structure of arbitrary types, even typic cannot judge the safety of a transmutation without special user-added annotations on type definitions. Although typic succeeds as a proof-of-concept, its maintainability is questionable, and the error messages it produces are lovecraftian.

The development approaches like typic's could, perhaps, be eased by stabilizing frunk-like structural reflection, or (better yet) by stabilizing a compiler plugin API for registering "smart" traits like TransmuteFrom. However, we suspect that such features would be drastically harder to design and stabilize.

Regardless of approach, almost all prior art attempts to reproduce knowledge already possessed by rustc during the compilation process (i.e., the layout qualities of a concrete type). Emulating the process of layout computation to any degree is an error-prone duplication of effort between rustc and the crate, in a domain where correctness is crucial.

Finally, community-led, crate-based approaches are, inescapably, unauthoritative. These approaches are incapable of fulfilling our motivating goal of providing a standard mechanism for programmers to statically ensure that a transmutation is well-defined or safe.

Prior art

Prior Art: Rust

A handful of dimensions of variation characterize the distinctions between prior art in Rust:

conversion complexity
conversion fallibility
source and destination types supported
implementation mechanism
stability hazards

We review each of these dimensions in turn, along with this proposal's location along these dimensions:

Conversion Complexity

Prior work differs in whether it supports complex conversions, or only simple transmutation. Pre-RFC FromBits/IntoBits's proposed traits include conversion methods that are implemented by type authors. Because end-users provide their own definitions of these methods, they can be defined to do more than just transmutation (e.g., slice casting). (This approach is similar to the uncon crate's FromUnchecked and IntoUnchecked traits, which provide unsafe conversions between types. These traits are safe to implement, but their conversion methods are not.)

In contrast, our RFC focuses only on transmutation. Our TransmutableFrom and TransmutableInto traits serve as both a marker and a mechanism: if Dst: TransmuteFrom<Src>, it is safe to transmute from Dst into Src using mem::transmute. However, these traits also provide transmutation methods that are guaranteed to compile into nothing more complex than a memcpy. These methods cannot be overridden by end-users to implement more complex behavior.

The signal and transmutability and mechanism are, in principle, separable. The convute crate's Transmute<T> trait is an unsafe marker trait representing types that can be transmuted into T. This is just a marker trait; the actual conversion mechanisms are provided by a separate suite of traits and functions. Our RFC combines marker with mechanism because we feel that separating these aspects introduces additional complexity with little added value.

Conversion Fallibility

Prior work differs in whether it supports only infallible conversions, or fallible conversions, too. The convute crate's TryTransmute<T> trait provides a method, can_transmute, that returns true a transmutation from Self to T is valid for a particular value of &self. An early version of typic abstracted a similar mechanism into an Invariants trait, with additional facilities for error reporting. Draft-RFC: Compatible/TryCompatible employs a similar mechanism to typic.

Typic removed support for fallible transmutation after reckoning with several challenges:

The causes of uncertain failure could be language-imposed (e.g., alignment or validity requirements), or library imposed (i.e., invariants placed on a structure's private fields).
The points of uncertain failures could be arbitrarily 'deep' into the fields of a type.
Error reporting incurs a runtime cost commensurate with the detail of the reporting, but the detail of reporting required by end-user depends on use-case, not just type. For instance: for some use-cases it may be necessary to know where and why a byte was not a valid bool; in others it may be sufficient to know simply whether an error occurred.

Finally, we observed that the mechanisms of fallible transmutation were basically separable from the mechanisms of infallible transmutation, and thus these challenges could be addressed at a later date. For these reasons, our RFC only addresses infallible transmutation.

While this RFC does not provide a grand, all-encompassing mechanism for fallible transmutation, the fundamental mechanisms of our RFC are useful for constructing safer, purpose-built fallible conversion mechanisms; e.g.:

/// Try to convert a `&T` into `&U`.
///
/// This produces `None` if the referent isn't appropriately
/// aligned, as required by the destination type.
pub fn try_cast_ref<'t, 'u, T, U>(src: &'t T) -> Option<&'u U>
where
    &'t T: TransmuteFrom<&'u U, NeglectAlignment>,
{
    if (src as *const T as usize) % align_of::<U>() != 0 {
        None
    } else {
        // Safe because we dynamically enforce the alignment
        // requirement, whose static check we chose to neglect.
        Some(unsafe { TransmuteFrom::unsafe_transmute_from(src) })
    }
}

In this approach, our RFC is joined by crates such as plain, bytemuck, dataview, safe-transmute, zerocopy, and byterepr, and several pre-RFCs (such as this and this). The ubiquity of these mechanisms makes a strong case for their inclusion in libcore.

Source and Destination Types Supported

Prior work differs in whether its API surface is flexible enough to support transmutation between arbitrary types, or something less.

Arbitrary Types

Approaches supporting transmutations between arbitrary types invariably define traits akin to either or both:

/// Indicates that `Self` may be transmuted into `Dst`.
pub unsafe trait TransmuteInto<Dst>
{ ... }

/// Indicates that `Self` may be transmuted from `Dst`.
pub unsafe trait TransmuteFrom<Src>
{ ... }

This approach, taken by our RFC, is used by at least two crates:

The convute crate's Transmute<T> trait is akin to the above definition of TransmuteInto.
The typic crate's TransmuteInto and TransmuteFrom traits almost exactly mirror the above definitions.

...and several proposals:

Pre-RFC: Safe coercions proposes a Coercible<A, B> trait that is implemented if A is safely transmutable into B.
Pre-RFC: FromBits/IntoBits proposes the traits IntoBits<U> and FromBits<T>.
Draft-RFC: FromBytes proposes the traits IntoBytes<U> and FromBytes<T>.
Draft-RFC: Compatible/TryCompatible proposes the trait Compatible<U>, akin to the above definition of TransmuteInto.

From/Into Bytes Transmutations

Other approaches adopt an API that only supports transmutation of a type into initialized bytes, and from initialized bytes. These approaches invariably define traits akin to:

/// Indicates that a type may be transmuted into an appropriately-sized array of bytes.
pub unsafe trait IntoBytes
{}

/// Indicates that a type may be transmuted from an appropriately-sized array of bytes.
pub unsafe trait FromBytes
{}

This is the approach taken by the zerocopy crate, and the Pre-RFC: Safe Transmute and Pre-RFC: Safe Transmute v2 proposals.

This approach is strictly less flexible than an API supporting transmutation between arbitrary types. It is incapable of representing transmutations of bytes into types with validity constraints, and incapable of representing transmutations of types with padding bytes into bytes.

Supporting additional transmutation source and destination types requires a commensurate addition of conversion traits. For instance, some of zerocopy's users require a trait that reflects types which can be transmuted from a buffer of zeroed bytes. This would require introducing an additional trait, FromZeros.

An advantage of this API is that it gives descriptive names to perhaps the two most common transmutations. However, an API providing transmutation between arbitrary types can encode FromBytes and IntoBytes:

// `Dst` is `FromBytes` if it can be safely transmuted *from* an
// equivalently sized array of `u8`.
unsafe impl<Dst> FromBytes for Dst
where
    Dst: TransmuteFrom<[u8; size_of::<Dst>()]>,
{}

// `Src` is `IntoBytes` if it can be safely transmuted *into* an
// equivalently sized array of `u8`.
unsafe impl<Src> IntoBytes for Src
where
    Src: TransmuteInto<[u8; size_of::<Src>()]>,
{}

For these reasons, we argue that a FromBytes/ToBytes style API is a poor foundation for in-language safe transmutation.

Bytes-to-Bytes Transmutations (aka "Plain Old Data")

Finally, many approaches (especially crates) supply a marker trait that represents "plain old data"; e.g.:

/// Implemented by types that are "plain old data":
pub unsafe trait PlainOldData
{}

This sort of trait is present in crates such as plain, bytemuck, dataview, safe-transmute, and pod, and at least two language proposals (here and here).

The exact definition of what constitutes "plain old data" varies between crates. One simple definition is that a type T is "plain old data" if it can be transmuted both from and into initialized bytes; i.e.:

unsafe impl<T> PlainOldData for T
where
    T: FromBytes + IntoBytes,
{}

This definition precludes useful transmutations. For instance, MaybeUninit<u8> is transmutable from a u8, but not into a u8.

Given this inflexibility, we argue that this approach is a poor foundation for in-language safe transmutation.

Implementation Mechanism

Not only does prior work differ in which traits are used to encode valid transmutations, they differ in the level of user intervention required to take advantage of the traits.

Manual

Fully manual approaches require type authors to implement the transmutation traits manually. The involved traits are unsafe, so it is up to type authors to verify for themselves that their hand-written implementations are sound. This is the approach taken by crates such as plain, bytemuck, safe-transmute, and pod, and at least one language proposal: Pre-RFC: Safe Transmute (which advocates for a "plain old data" API).

In semi-manual approaches, type authors simply derive the applicable traits, using derive macros that produce a compile-error if the implementation is not sound. This approach is realized by crates such as (zerocopy, zeroable and dataview) and advocated by at least two language proposals: Pre-RFC: Safe Transmute v2 (which advocates for a FromBytes/IntoBytes-style API), and Pre-RFC FromBits/IntoBits (which advocates for a general-transmutation API).

We believe that the implementation burden these approaches place on end-users, and their inflexibility, make them a poor foundation for in-language safe transmutation:

These approaches require authors to implement and, potentially, verify a large number of unsafe traits, ranging from O(n) implementations for plain-old-data trait approaches, to potentially many more.
These approaches are generally impractical for APIs that permit truly general transmutation, as type authors can only construct implementations of the transmutation traits for types they have at their disposal.
These approaches conflate transmutation stability with transmutation safety. An end-user wishing to transmute a type for which its author has not manually implemented the applicable traits must resort to the wildly unsafe mem::transmute.

Automatic

Automatic approaches implement the transmutation traits without user intervention, whenever it is sound to do so. This is the approach taken by our RFC. Automatic mechanisms appear in at least four prior language proposals:

Pre-RFC: Safe coercions
Draft-RFC: from_bytes
Pre-RFC: Trait for deserializing untrusted input
Draft-RFC: compatible_trait

The typic crate mocks a fully-automatic approach: its TransmuteFrom trait is usable with any types that are repr(C), or otherwise have a well-defined memory layout. (In practice, since Rust lacks reflection over type definitions, repr(C) annotations much be changed to typic::repr(C).)

Safety Hazards

Fully automatic approaches introduce, at the very least, a safety hazard: they supply a safe constructor for types, without the consent of those types' authors. If a type author hid the internals of their type because they do not wish for its implementation details to become a part of the type's API for SemVer for safety purposes, an automatic transmutation mechanism subverts that intent.

No attempt to avoid this hazard is made by most of the proposals featuring automatic mechanisms; e.g.:

Draft-RFC: from_bytes
Pre-RFC: Trait for deserializing untrusted input
Draft-RFC: compatible_trait

Hazard-Avoidant

The automatic mechanism proposed by Pre-RFC: Safe coercions exploits field visibility, requiring that all fields that have different types in Src and Dst are visible at the location where the coercion is made. This approach falls short in three respects:

Confining the visibility requirement only to fields of different types is insufficient; two different types with identical field types may subject those fields to different invariants.
The 'location' where the coercion is made is ill-defined; the presence of the proposed Coercible trait may be far-removed from the location of the actual conversion (if any conversion occurs at all).
Field visibility stabilizes the structure of a type, but not its layout (e.e., its size).

Our RFC, typic, and Haskell exploit the related concept of constructability. Typic uses a simplified, scope-unaware formulation of constructability that suffers from a soundness hole induced by the pub-in-priv trick.

Prior Art: Haskell

Haskell's Coercible typeclass is implemented for all types A and B when the compiler can infer that they have the same representation. As with our proposal's TransmuteFrom trait, instances of this typeclass are created "on-the-fly" by the compiler. Coercible primarily provides a safe means to convert to-and-from newtypes, and does not seek to answer, for instance, if two u8s are interchangeable with a u16.

Haskell takes an algebraic approach to this problem, reasoning at the level of type definitions, not type layouts. However, not all type parameters have an impact on a type's layout; for instance:

#[repr(C)]
struct Bar<U>(PhantomData<U>);

#[repr(transparent)]
struct Foo<T, U>(T, Bar<U>);

Foo's layout is impacted solely by T, not U, but this isn't necessarily clear by looking at the definition of Foo. To reason about these scenarios, Haskell introduces the concept of type parameter roles—labels that denote the relationship of a type parameter to coercibility.

Our RFC does not need the concept of roles, because it does not attempt to abstractly reason about type definitions. Rather, it reasons about type layouts. This example, for instance, does not pose a challenge to our proposal:

trait SomeTrait { type AssociatedType; }

#[repr(C)]
struct MyStruct<T: SomeTrait>(pub T, pub T::AssociatedType);

For a particular T, MyStruct<T> will have a particular layout. Our proposed TransmuteFrom trait reasons about the layouts of types (which are fully concrete), not the definitions (which may be somewhat abstract).

Unresolved questions

Questions To Be Resolved Before RFC Acceptance

The following unresolved questions should be resolved during the RFC process:

Unhandled Use-Cases?

We endeavored to design an API surface with (nearly) zero compromises. However, if you have a use-case that you believe is neither satisfied outright by our proposal, nor aided by our proposal, we would urgently like to hear of it.

Extensions for Inclusion?

In Future Possibilities, we propose a number of additional abstractions that are aided by this RFC. Some of these abstractions are commonplace in prior art and should perhaps be included with this RFC. Some of our proposed extensions could begin their crates that work on stable Rust; others, such as generic atomics, require nightly-only intrinsics.

Questions To Be Resolved Before Feature Stabilization

The following unresolved questions should be resolved before feature stabilization:

When should `Muckable` be automatically implemented?

There is considerable overlap between the effect of Muckable and making fields pub. A type that is implicitly constructible already permits the arbitrary initialization and modification of its fields. While there may be use-cases for implementing Muckable on a type with private fields, it is an odd thing to do, as it sends a confusing, mixed-message about visibility. Downstream, forgetting to implement Muckable for an implicitly constructible type forces users to needlessly resort to unsafe transmutation.

Muckable may be automatically derived for types that are publicly implicitly constructible, without posing a stability or safety hazard. The type Foo is effectively Muckable here:

#[repr(C)]
pub struct Foo(pub u8, pub u16);

...and here:

#[repr(C)]
pub struct Foo(pub Bar, pub u16);

#[repr(C)]
pub struct Bar;

...and here:

#[repr(C)]
pub struct Foo<T: Muckable, U: Muckable>(pub T, pub U);

A type is not effectively Muckable if its fields are not all pub, or if it is marked with #[non_exhaustive], or if the fields themselves are not effectively Muckable.

Questions Out of Scope

We consider the following unresolved questions to be out-of-scope of this RFC process:

Future possibilities

Safe Union Access

Given TransmuteFrom, the compiler can determine whether an access of a union variant of type V from a union U is safe by checking V: TransmuteFrom<U>. In accesses where that bound is satisfied, the compiler can omit the requirement that the access occur in an unsafe block.

Limited Stability Guarantees

Implementing Muckable for a type allows for safe and stable transmutations without requiring the type's author to enumerate all useful transmutations (à la From), but at the cost of requiring full layout stability. For some use-cases, the reverse might be preferable: explicitly enumerate the set of stable transmutations without promising full layout stability.

To accommodate this use-case, we could permit users to write implementations of TransmuteFrom in the form:

unsafe impl TransmuteFrom<Foo> for Bar
where
    Bar: TransmuteFrom<Foo, NeglectSafety>
{}

Such implementations would conform to the usual orphan rules and would not permit users to override TransmuteFrom's methods.

Extension: Layout Property Traits

Given TransmuteFrom, crates can define traits that are implemented only when size and alignment invariants are satisfied, such as SizeEq or AlignLtEq. For additional details, see here.

Extension: Byte Transmutation Traits and Safe Initialization

Given TransmuteFrom, crates can define zerocopy-style traits. For additional details, see here.

Extension: Slice and `Vec` Casting

Given TransmuteFrom, crates can define traits for "transmuting" slices and Vecs. For additional details, see here.

Extension: `include_data!`

Given TransmuteFrom, crates can define a more useful alternative to include_bytes!. For additional details, see here.

Extension: Generic Atomics

Given TransmuteFrom, crates can define a generic Atomic<T> alternative to the various Atomic* types. For additional details, see here.

Files

0000-safer-transmute.md

Latest commit

History

0000-safer-transmute.md

File metadata and controls

Safer Transmute RFC

Summary

Motivation

Codifying Language Layout Guarantees

Expressing Layout Requirements

Guide-level explanation

Terminology & Concepts

📖 Transmutation

📖 Safer Transmutation

📖 Well-Definedness

📖 Safety

📖 Stability

Concepts in Depth

📖 When is a transmutation well-defined?

Well-Defined Representation

Requirements on Owned Values

Preserve or Broaden Bit Validity

Preserve or Shrink Size

Requirements on References

Preserve or Shrink Size

Preserve or Relax Alignment

Preserve or Shrink Lifetimes

Preserve or Shrink Uniqueness

Mutate-able References Must Preserve Validity

📖 When is a transmutation safe and stable?

Well-Definedness Does Not Imply Safety

Well-Definedness Does Not Imply Stability

Signaling Safety and Stability with Muckable

Mechanisms of Transmutation

Neglecting Static Checks

NeglectAlignment

NeglectValidity

NeglectSafety

Reference-level explanation

Implementation Guidance

Implementing Muckable

Implementing TransmuteFrom

Constructability and Transmutation

Implementing differing_sizes lint

Minimal Useful Stabilization Surface

Complete API Surface

Drawbacks

Rationale and alternatives

Rationale: TransmuteFrom

Why support arbitrary transmutation?

Alternative: Implementing this RFC in a Crate

Prior art

Prior Art: Rust

Conversion Complexity

Conversion Fallibility

Source and Destination Types Supported

Arbitrary Types

From/Into Bytes Transmutations

Bytes-to-Bytes Transmutations (aka "Plain Old Data")

Implementation Mechanism

Manual

Automatic

Safety Hazards

Hazard-Avoidant

Prior Art: Haskell

Unresolved questions

Questions To Be Resolved Before RFC Acceptance

Unhandled Use-Cases?

Extensions for Inclusion?

Questions To Be Resolved Before Feature Stabilization

When should Muckable be automatically implemented?

Questions Out of Scope

Future possibilities

Safe Union Access

Limited Stability Guarantees

Extension: Layout Property Traits

Extension: Byte Transmutation Traits and Safe Initialization

Extension: Slice and Vec Casting

Extension: include_data!

Signaling Safety and Stability with `Muckable`

`NeglectAlignment`

`NeglectValidity`

`NeglectSafety`

Implementing `Muckable`

Implementing `TransmuteFrom`

Implementing `differing_sizes` lint

Rationale: `TransmuteFrom`

When should `Muckable` be automatically implemented?

Extension: Slice and `Vec` Casting

Extension: `include_data!`