FM-259: Sign all incomplete checkpoints #292

aakoshh · 2023-09-29T15:46:42Z

The PR changes the end-of-period checkpoint signature broadcasting by validators to apply on all incomplete (a.k.a. pending) checkpoints where they were validators.

So, instead of doing this on node start, it's done on each checkpoint period end. This has the following benefits:

main.rs stays simpler and the checkpointing logic is kept in the interpreter
a validator doesn't have to restart their node to fill in their signatures if e.g. they ran out of funds, they just need to get some tokens and it will automatically be retried

adlrocha · 2023-10-03T08:20:42Z

fendermint/vm/interpreter/src/fvm/checkpoint.rs

+            .find(|v| v.public_key.0 == validator_ctx.public_key)
+            .cloned()
+        {
+            // TODO: Code generation in the ipc-solidity-actors repo should cater for this.


@cryptoAtwill, can you double-check that this is the case once you migrate to the ipc-solidity-actors bindings?

I don't remember seeing that this has been implemented already, we just talked about the fact that the bindings generate as many versions of SubnetID and whatnot as there are facets. So it's just a note for the future that if this gets out of hand, we should try to solve it there.

adlrocha · 2023-10-03T08:23:59Z

fendermint/vm/interpreter/src/fvm/exec.rs

+                    let height = checkpoint.block_height;
+                    let validator_ctx = ctx.clone();
+
+                    tokio::spawn(async move {


Why is it possibel to do the broadcast of incomplete signatures in parallel here without resulting in nonce races? (I also ask because in the comment above you mention that broadcasts can't be done in parallel).

Sorry, it looks like my comment wasn't as clear as it could have been.

What I meant was that we should not try to kick off a separate background task for each pending checkpoint like this:

for cp in pending_checkpoints { tokio::spawn(async move { checkpoint::broadcast_signature(broadcaster, cp).await; }); }

That's because every time the broadcaster is asked to send a transaction it will query the state for the nonce, and in this case that will surely mean all of the checkpoint submissions will get the same nonce and only one will go through.

Instead, what we have is effectively this:

tokio::spawn(async move { for cp in pending_checkpoints { checkpoint::broadcast_signature(broadcaster, cp).await; } });

So it's not doing the submissions in parallel, they are done one after the other, but in the background.

aakoshh requested review from adlrocha, cryptoAtwill and dnkolegov September 29, 2023 15:46

dnkolegov approved these changes Oct 2, 2023

View reviewed changes

aakoshh marked this pull request as draft October 2, 2023 13:02

aakoshh marked this pull request as ready for review October 2, 2023 13:02

adlrocha approved these changes Oct 3, 2023

View reviewed changes

Base automatically changed from fm-255-no-broadcast-if-syncing to main October 3, 2023 09:07

FM-259: Send signatures for all pending checkpoints

808e14c

aakoshh force-pushed the fm-259-sign-incomplete-ckpt branch from ef135e5 to 808e14c Compare October 3, 2023 09:17

aakoshh merged commit b2ac4ff into main Oct 3, 2023
1 check passed

aakoshh deleted the fm-259-sign-incomplete-ckpt branch October 3, 2023 09:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FM-259: Sign all incomplete checkpoints #292

FM-259: Sign all incomplete checkpoints #292

aakoshh commented Sep 29, 2023

adlrocha Oct 3, 2023

aakoshh Oct 3, 2023

adlrocha Oct 3, 2023

aakoshh Oct 3, 2023 •

edited

Loading

FM-259: Sign all incomplete checkpoints #292

FM-259: Sign all incomplete checkpoints #292

Conversation

aakoshh commented Sep 29, 2023

adlrocha Oct 3, 2023

Choose a reason for hiding this comment

aakoshh Oct 3, 2023

Choose a reason for hiding this comment

adlrocha Oct 3, 2023

Choose a reason for hiding this comment

aakoshh Oct 3, 2023 • edited Loading

Choose a reason for hiding this comment

aakoshh Oct 3, 2023 •

edited

Loading