Switch to `InsnKind` in the program table #533

naure · 2024-11-02T02:23:24Z

Idea extracted from discussions with @matthiasgoergens (thread).

All circuits support a single constant opcode/funct3/funct7, so they can be merged into a single constant with enum InsnKind.

This saves 1 fixed column, 1 fixed record field, and cleans up a bit.

Note: One day we might merge some opcode circuits together or split circuits for special cases, to improve the verifier or prover cost respectively. But that will probably not match the original opcode/etc classification. So there’s no drawback in removing it.

Double-check that opcode/funct3/funct7 are constants in all circuits.
Change the layout of the program table.
Change the calls to circuit_builder.lk_fetch.
Remove func7 from imm_internal.

The text was updated successfully, but these errors were encountered:

matthiasgoergens · 2024-11-02T02:51:43Z

Thanks!

I see you added a 'speed' label. Specifically, this would speed up proving a bit. But more important is the simplification of the circuits. Speed is just a welcome side effect. :⁠-⁠)

Do we have a label for that as well?

Simpler circuits are easier to understand and audit. Less likely to hide bugs and harbour exploits.

naure · 2024-11-02T02:59:52Z

We do now (cleanup).

matthiasgoergens · 2024-11-02T04:22:59Z

So my suggestion is to make this table as simple as possible. And then use the same data structure (or as much as the same as possible) to drive both the circuits and the emulator.

See how this table is essentially the same as DecodedInstruction in 'decode statically' PR with the same fields. And together with the 'named struct fields' PR we can have actually the same struct on the level of Rust code.

naure · 2024-11-02T06:53:12Z

use the same data structure (or as much as the same as possible) to drive both the circuits and the emulator.

Of course. It is called DecodedInstruction. The circuits have an extra step, that is a serialization to a list of field elements: DecodedInstruction -> InsnRecord.

this table is essentially the same as DecodedInstruction

My goal is to completely separate the standard and stable emulator logic from weird ZK tricks that change every day.

I had just been postponing some needed cleanup. Now fixed here: #536

Hope that clears it up.

matthiasgoergens · 2024-11-02T10:57:43Z

My goal is to completely separate the standard and stable emulator logic from weird ZK tricks that change every day.

I suggest that the goal should be to make the emulator use the same logic as the circuits. Because we need that logic anyway, and that way it gets an additional pass of tests (and the emulator is easier to test than the circuits.)

That's especially necessary because (a) as you say the ZK tricks might change, thus need special scrutiny, and (b) emulator speed isn't as crucial as circuit correctness. (Even emulator correctness is less important than circuit correctness.)

naure · 2024-11-02T11:18:58Z

That is not any of this works. Will explain some other time.

_Matthias:_ Risc-V instruction are encoded in multiple different 'types'. These types have various fields. One particularly common field is `opcode`, which typically (but not always) occupies the 7 least significant bits. As the name suggests, opcode helps code _which_ operation is intended. I say 'helps' because some operations share the same opcode. Our code before this PR assumed that bits `12..=14` in the instruction word are exactly enough extra information to determine the operation. (Confusingly enough, our code follows Risc0 convention, and calls that range of bits `funct3`. Only some operations have a field named `funct3` in the spec, and not all of them have it in the same place.) That assumption is wrong, and our circuits are unsound. Specifically, the distinction between 'signed' (ie 'arithmetic') and 'unsigned' (ie 'logical') immediate right shifts is encoded in bit 30, as laid out in section 2.4.1 of the [spec](https://github.com/riscv/riscv-isa-manual/releases/download/20240411/unpriv-isa-asciidoc.pdf). We could introduce a new column called 'bit 30' in the program table to fix the symptom. But instead of such a piecemeal approach, we attack the underlying problem: we switch the program table to record `InsnKind`. `InsnKind` directly reports the kind of instruction we are dealing with. We already have that information available in the decoder, and just re-use it here. Fixes #539 and #533 --------- Co-authored-by: Aurélien Nicolas <[email protected]>

naure mentioned this issue Nov 2, 2024

Prototype for ahead-of-time decoding #519

Closed

naure added the speed label Nov 2, 2024

naure added the cleanup Refactors, simplifications, hindsight 20/20 tasks. label Nov 2, 2024

matthiasgoergens changed the title ~~Switch to InsnKind in the program table~~ Switch to InsnKind in the program table Nov 2, 2024

naure self-assigned this Nov 2, 2024

naure mentioned this issue Nov 2, 2024

Switch to InsnKind in the program table #538

Merged

naure linked a pull request Nov 2, 2024 that will close this issue

Switch to InsnKind in the program table #538

Merged

hero78119 closed this as completed in #538 Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to `InsnKind` in the program table #533

Switch to `InsnKind` in the program table #533

naure commented Nov 2, 2024 •

edited

Loading

matthiasgoergens commented Nov 2, 2024 •

edited

Loading

naure commented Nov 2, 2024

matthiasgoergens commented Nov 2, 2024 •

edited

Loading

naure commented Nov 2, 2024 •

edited

Loading

matthiasgoergens commented Nov 2, 2024 •

edited

Loading

naure commented Nov 2, 2024

Switch to InsnKind in the program table #533

Switch to InsnKind in the program table #533

Comments

naure commented Nov 2, 2024 • edited Loading

matthiasgoergens commented Nov 2, 2024 • edited Loading

naure commented Nov 2, 2024

matthiasgoergens commented Nov 2, 2024 • edited Loading

naure commented Nov 2, 2024 • edited Loading

matthiasgoergens commented Nov 2, 2024 • edited Loading

naure commented Nov 2, 2024

Switch to `InsnKind` in the program table #533

Switch to `InsnKind` in the program table #533

naure commented Nov 2, 2024 •

edited

Loading

matthiasgoergens commented Nov 2, 2024 •

edited

Loading

matthiasgoergens commented Nov 2, 2024 •

edited

Loading

naure commented Nov 2, 2024 •

edited

Loading

matthiasgoergens commented Nov 2, 2024 •

edited

Loading