Fix relocation #28

Woyten · 2018-03-23T19:42:00Z

In order to make global variables and dynamic dispatch work, we need to compile binaries conforming to the R_ARM_SBREL32 relocation model.

As far as I understand we need to perform two steps:

Migrate the code of tock/userland/libtock/crt0.c to Rust

Pass compiler flags to the LLVM/LLD toolchain equivalent to:

-msingle-pic-base
-mpic-register=r9
-mno-pic-data-is-text-relative

The text was updated successfully, but these errors were encountered:

Woyten · 2018-08-22T21:04:32Z

@alevy I played around with the relocation problem the whole weekend but I am completely lost now.

My findings:

Relocations are not emitted by default. They can be emitted via -C link-args=--emit-relocs.

I am not sure whether -C relocation-model=ropi-rwpi represents the correct relocation model due to the following problems I encountered:

vtables point to adresses above 0x80000000. Accessing them, obviously, crashes the program. I tested the vtable value using the following code:

let my_int = 5usize;
let vtable_location = mem::transmute::<_, (usize, usize)>(&my_int as &MyTrait).1; // 0x00020b64 (ACCESSIBLE)
let first_vtable_entry = ptr::read_volatile(vtable_location as *const usize);     // 0x80000737 (NOT ACCESSIBLE)

static muts crash during link time. The following example results in an unrecognized reloc error:

static mut STATIC_MUT: usize = 0;
debug::print_as_hex(STATIC_MUT);
STATIC_MUT = 1;
debug::print_as_hex(STATIC_MUT);

There are relocations of different types for the trait objects but none of them is of type R_ARM_SBREL32. I queried the relocations using:
```
readelf --relocs -W cortex-m4.elf|rg MyTrait
```
The printed relocation types are R_ARM_THM_MOVW_PREL_NC, R_ARM_THM_MOVT_PREL and R_ARM_ABS32.
The data segment is empty.

If, on the other hand, I build the code using -C relocation-model=pic, I observe the following:

vtables still don't work. In fact, they crash a little earlier:

let my_int = 5usize;
let vtable_location = mem::transmute::<_, (usize, usize)>(&my_int as &MyTrait).1; // 0x8002002c (NOT ACCESSIBLE)

static muts can be linked but they point to garbage:

static mut STATIC_MUT: usize = 99;
let dereferenced = &STATIC_MUT as *const _ as usize; // 0x8002002c (NOT ACCESSIBLE)

Relocations are of type R_ARM_ABS32 and R_ARM_REL32. This seems closer to what we want but it's still not R_ARM_SBREL32.
There is a data section but I cannot tell whether the content makes sense.

In any case, no matter which relocation model I choose:

The GOT is empty. Do we expect elements in it? I guess not as we compile a static binary from scratch.
The value of r9 has no effect. I would expect that some relocatable references depend on r9 according to llvm-mirror/lld@29241e3.
The reldata part of the _start header is located at 0x80000000 which, again, is not accessible.

Do you think I am on the right track?

Woyten · 2018-10-06T16:41:51Z

@torfmaster and I were finally able to prove that trait objects can work in Tock OS. See #56 for more details.

Unfortunately, I cannot recommend applying the strategy mentioned in the PR. It has too many drawbacks (like no real position independence) and relies on hacks or details that might cease to be valid in a newer version of rustc.

In order to get libtock-rs binaries running properly we need to fix some external tools. The following strategy should enable the remaining Rust features:

Fix trait objects
We think that rustc contains a bug in the ropi relocation model leading to corrupt vtable lookups. Our compiled binaries try to find vtable functions at absolute addresses. This, however, conflicts with the idea of position independent code execution. The most probable reason for the bug is that an offset based on the program counter has been forgotten.
Fix static muts
static muts can be compiled but not linked. According to llvm-mirror/lld@29241e3, LLD supports R9 based relocation. In practice, it refuses to process the emitted relocation types. This could be a problem with the LLD version used by rustc.
Improve string literal ergonomics
If we want to print the string literal to the console, we need to manually copy it from flash to RAM first (e.g. by using String::from). Otherwise, the allow operation of the kernel will crash because we are not allowed to allow memory on the flash. My proposed possible solutions to the problem:
1. Elegant: Add a new tock syscall (e.g. allow_ro) with read-only access to the flash. There are other reasons why an allow_ro syscall is a good idea like better borrow checker support.
2. Difficult: Relocate the string literals from flash to RAM during startup. The copy step is easy. The difficult part is to adapt rustc, s.t. string literals are no longer accessed in flash but in RAM. I think that's what libtock-c is doing. It also requires that the linker problem mentioned above is solved.
3. Poor: Ignore the problem and enforce owned Strings (needs allocation, we want to opt-out for it) and/or write! (slow).

@alevy What do you think about those problems? We would be happy if someone else could help fixing the rustc and LLD problems. The rustc problem might be interesting for @japaric and the embedded working group as well.

jrvanwhy · 2019-01-19T00:02:23Z

There are at least two problems getting in the way of rustc support for ROPI-RWPI:

LLVM's ROPI-RWPI implementation does not move .rodata values that are relocated into .data, which prevents the relocations from being implemented on microcontrollers (.rodata is truly RO on flash).
For some reason, inter-crate references that should be using ROPI relocation use RWPI relocation. This issue is rustc specific; I was unable to reproduce it using clang.

In the meantime, static linking of Rust apps appears to be possible (avoiding relocation entirely). I'm putting together a PR to implement static linking. Fortunately, static linking doesn't require any code changes that are incompatible with ROPI-RWPI, although it requires linker script changes.

jrvanwhy · 2019-02-08T22:40:42Z

Static linking works as of #64 ; making ROPI-RWPI relocation work correctly is a larger problem that'll take longer to solve.

someone(TM) really should fix tock#28.

luojia65 · 2021-05-09T08:07:40Z

Will this be achieveable on platforms other than ARM? We may wish to execute embassy on more achitectures.

hudson-ayers · 2021-05-10T14:55:49Z

From the perspective of libtock-rs, I think the hope is for this to be achieved on both ARM and RISC-V eventually. Unfortunately we are blocked on upstream support in LLVM for PIC in both cases. Thus, currently it does not work on either architecture -- though libtock-c does support relocatable apps on ARM only, thanks to using gcc rather than LLVM.

I think the latest status of RISC-V ROPI/RWPI can be followed here: riscv-non-isa/riscv-elf-psabi-doc#128

And the latest status for the issues with ARM thumb targets can be followed here: rust-lang/rust#54431

dcz-self · 2022-03-05T14:21:50Z

I'm also rather interested in working relocations. Having read the Rust-lang thread, LLVM exchange, and the rust-embedded IRC log, I noted two statements that stand out.

In the LLVM emails, "I don't think such transformation belongs into clang.", regarding initializers, and "for apps you could roll your own in-kernel dynamic linker", from the IRC discussion.

Was it considered to ignore the ROPI/RWPI approach, and instead rely on relocations and fix them up using a linker? That step could even take place in tockloader, while flashing (assuming that applications once flashed aren't going to be moved again).

If there are still problems with relocations not being emitted enough, the actual step of linking object files could be moved to flash-time, with the relevant offsets (or linker files) calculated based on where the app is going to land.

If any of those approaches is not totally crazy, I'm willing to try implementing it - loadable applications are a must for me.

hudson-ayers · 2022-03-05T17:22:12Z

lowRISC is working on an ePIC implementation for RISC-V and hopes to upstream it to LLVM: https://github.com/lowRISC/epic-c-example / https://github.com/lowRISC/llvm-project/commits/epic. Our current hope is that this work will at least make loadable applications possible for RISC-V, though support for ARM may take longer as I do not believe lowRISC currently plans port this work to other architectures.

jrvanwhy · 2022-03-05T18:20:03Z

Was it considered to ignore the ROPI/RWPI approach, and instead rely on relocations and fix them up using a linker? That step could even take place in tockloader, while flashing (assuming that applications once flashed aren't going to be moved again).

If there are still problems with relocations not being emitted enough, the actual step of linking object files could be moved to flash-time, with the relevant offsets (or linker files) calculated based on where the app is going to land.

I'm pretty sure that your idea is workable. It is not a solution that works for every user of libtock-rs, which is why lowRISC is working on ePIC (but as Hudson mentioned, they're primarily focused on RISC-V).

One other solution that the Tock project has looked at (which libtock-c uses) is to compile each process multiple times for different locations, and have tockloader choose which TBF file to deploy on a system based on the addresses it is compiled for.

dcz-self · 2022-03-06T08:16:29Z

Didn't libtock-c use actually position-independent binaries? That's what I gathered from the discussion about libtock-rs.

hudson-ayers · 2022-03-06T18:18:48Z

libtock-c uses actually position-independent binaries for ARM targets, but gcc does not support position-independent binaries for RISC-V.

dcz-self · 2022-03-06T19:25:37Z

Thanks. I just realized that static linking also makes the RAM address fixed, which is rather suboptimal when applications are meant to be able to be loaded in any order. Perhaps some form of PIC with rwdata section relocations at runtime could solve that - if such relocations are supported by the compiler.

jrvanwhy · 2022-03-07T02:40:14Z

Thanks. I just realized that static linking also makes the RAM address fixed, which is rather suboptimal when applications are meant to be able to be loaded in any order.

Yes -- when I said "compile each process multiple times for different locations", each "location" is a combination of a flash address range and a RAM address range.

Perhaps some form of PIC with rwdata section relocations at runtime could solve that - if such relocations are supported by the compiler.

I do not think that is possible with any relocation mode that LLVM supports, unfortunately.

alevy · 2022-03-18T15:52:27Z

@dcz-self this (rather old) blog post explains a little bit of the complexity with PIC: https://www.tockos.org/blog/2016/dynamic-loading/

libtock-c works because GCC supports the particular kinds of variants of PIC we need, while LLVM doesn't (actually there was a reasonably complete patch from somebody at ARM, I believe, back in the day but it wasn't accepted).

Perhaps some form of PIC with rwdata section relocations at runtime could solve that - if such relocations are supported by the compiler.

Proposals are very welcome! The main constraint are: (1) code lives in flash, not RAM, and we probably don't want to be rewriting flash on every process reboot (because of write degredation and performance) and (2) the binary size should be reasonably small---all the extra information retained for dynamic loading in, e.g., Linux ELFs results in executables the are typically way too big for the target platforms. But neither of these means there isn't some sweet spot design that is possible.

potto216 · 2022-12-20T22:57:39Z

With Rust merged into GCC 13, will that eventually make it possible to resolve this as the implementation matures?

jrvanwhy · 2022-12-20T23:07:18Z

With Rust merged into GCC 13, will that eventually make it possible to resolve this as the implementation matures?

rustc_codegen_gcc is on track to be usable well before GCC's Rust frontend, so I don't think that changes anything. Either way, GCC only supports the necessary relocation mode on ARM, not RISC-V, so it's not a complete solution.

If ePIC ends up being RISC-V only, we may end up implementing relocation using rustc_codegen_gcc on ARM and ePIC on RISC-V.

nathaniel-brough · 2024-06-29T03:07:52Z

Is there any updates on this? I'm aware that this seems to be blocked by either rustc_codegen_gcc being usable or an upstream fix in rustc and/or llvm but I'm not well versed enough in PIC to understand what the problem is or what the current status is of work towards those upstream fixes. Although it does seem like rustc_codegen_gcc has seen a lot of development in the last 2yrs.

alevy · 2024-06-29T04:58:47Z

@silvergasp the answer right now is that waiting for LLVM and RISC-V (and any other non-cortex-m architecture) to support something that would work as we hope is not a good plan.

There are basically four ways forward as far as I understand, two of which have a good chance of happening soon:

The status quo, which is that you have to compile a process "on-demand" for a specific location. This is completely not relocation, but works if you're able/willing to recompile each time you install an app (e.g. if you're shipping a bunch of apps as a bundle always).
Simpler relocation that requires a fixed distance between text and data. This is particularly reasonable on platforms with no executable flash, and thus code needs to be loaded into memory anyway to run. It's also probably reasonable otherwise, it just indices more memory pressure. This is almost certainly going to be supported.
Simpler relocation that require some patches before execution. Perhaps it would be reasonable, for example, to patch addresses when installing an app onto a board (e.g. in tockloader). This is a bit more complex but seems doable.
Work on contributing to LLVM and then Rust to support these relocations. It's probably already close with ropi-rwpi. Contributing to LLVM and Rust is both very important generally and a full time job, so this is a bit harder for folks to peel away time for.

Woyten added the key feature label Mar 23, 2018

jrvanwhy self-assigned this Feb 8, 2019

fhars mentioned this issue Oct 14, 2019

fix nrf52 layout #98

Merged

fhars added a commit to fhars/libtock-rs that referenced this issue Oct 16, 2019

mention potential SRAM layout issues in README

34d2896

someone(TM) really should fix tock#28.

gendx mentioned this issue Jan 7, 2020

Add debug statements to show how and where process are loaded from flash to sram. tock/tock#1514

Merged

2 tasks

gendx mentioned this issue Mar 18, 2020

[Bug] Corrupted console output with large buffers. tock/tock#1697

Closed

tjkirch mentioned this issue May 5, 2023

SDK 2.0 compatibility pd-rs/crankstart#45

Closed

twilfredo mentioned this issue Nov 7, 2023

tockloader: failing to load libtock-rs apps #521

Closed

gentooza mentioned this issue Mar 12, 2024

adding some minor changes for compiling Arduino nano33BLE #537

Merged

thejpster mentioned this issue Oct 5, 2024

-Crelocation-model=rwpi (and possibly others) are unsound due to affecting the ABI rust-lang/rust#131300

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix relocation #28

Fix relocation #28

Woyten commented Mar 23, 2018 •

edited

Loading

Woyten commented Aug 22, 2018

Woyten commented Oct 6, 2018 •

edited

Loading

jrvanwhy commented Jan 19, 2019

jrvanwhy commented Feb 8, 2019

luojia65 commented May 9, 2021

hudson-ayers commented May 10, 2021

dcz-self commented Mar 5, 2022

hudson-ayers commented Mar 5, 2022

jrvanwhy commented Mar 5, 2022

dcz-self commented Mar 6, 2022

hudson-ayers commented Mar 6, 2022

dcz-self commented Mar 6, 2022

jrvanwhy commented Mar 7, 2022

alevy commented Mar 18, 2022

potto216 commented Dec 20, 2022

jrvanwhy commented Dec 20, 2022

nathaniel-brough commented Jun 29, 2024

alevy commented Jun 29, 2024

Fix relocation #28

Fix relocation #28

Comments

Woyten commented Mar 23, 2018 • edited Loading

Woyten commented Aug 22, 2018

Woyten commented Oct 6, 2018 • edited Loading

jrvanwhy commented Jan 19, 2019

jrvanwhy commented Feb 8, 2019

luojia65 commented May 9, 2021

hudson-ayers commented May 10, 2021

dcz-self commented Mar 5, 2022

hudson-ayers commented Mar 5, 2022

jrvanwhy commented Mar 5, 2022

dcz-self commented Mar 6, 2022

hudson-ayers commented Mar 6, 2022

dcz-self commented Mar 6, 2022

jrvanwhy commented Mar 7, 2022

alevy commented Mar 18, 2022

potto216 commented Dec 20, 2022

jrvanwhy commented Dec 20, 2022

nathaniel-brough commented Jun 29, 2024

alevy commented Jun 29, 2024

Woyten commented Mar 23, 2018 •

edited

Loading

Woyten commented Oct 6, 2018 •

edited

Loading