Add design for the leader validator loop #2650

aeyakovenko · 2019-02-02T22:52:27Z

Problem

Lack of design for a clear leader/validator fullnode loop.

Summary of Changes

This is a proposal for a different PoH interface to leaders and validators that allow the fullnode to switch modes as the PoH reaches the scheduled slot.

Fixes #

garious

It's not clear to me if this chapter is attempting to describe the existing behavior or the desired behavior. If the latter, how does it compare to the former?

book/src/leader-validator-transition.md

aeyakovenko · 2019-02-04T15:26:39Z

It's not clear to me if this chapter is attempting to describe the existing behavior or the desired behavior. If the latter, how does it compare to the former?

This is a proposal for the desired behavior. I am not sure what the current implementation does; it's hard to grok.

aeyakovenko · 2019-02-04T15:49:14Z

@garious or my understanding of the current code is that, there is no main control loop, and the leader and validators run concurrently, but try to not operate on the same slot concurrently.

book/src/leader-validator-transition.md

aeyakovenko · 2019-02-07T02:58:49Z

@rob-solana, I don’t have a strong opinion on TVU+TPU concurrency. The loop can just start the tpu asynchronously. But doing so means some weird interactions with voting and PoH reset in the TVU

book/src/leader-validator-transition.md

garious · 2019-02-07T23:08:06Z

@rob-solana, @aeyakovenko, I'm not seeing value in the distinction between PoH Generator and PoH Recorder. Seems like we can use only the PoH Recorder. There doesn't need to be this concept of waiting for a generator to finish. Instead, the TPU can use its Recorder as the timer. Once it's at a certain height, it can start using the same PoH to record entries.

rob-solana · 2019-02-07T23:31:32Z

@rob-solana, @aeyakovenko, I'm not seeing value in the distinction between PoH Generator and PoH Recorder. Seems like we can use only the PoH Recorder. There doesn't need to be this concept of waiting for a generator to finish. Instead, the TPU can use its Recorder as the timer. Once it's at a certain height, it can start using the same PoH to record entries.

ok, means we're also creating a bank_fork for the TPU at the time that the recorder is constructed?

garious · 2019-02-07T23:58:59Z

We reset the recorder after voting, so yes, makes sense to me that we'd also fork at that point too. We might want to fork off that parent if, for example, we reject our own TPU's fork later down the line.

aeyakovenko · 2019-02-08T00:43:52Z

@garious then the TPU and TVU can’t run concurrently. At least the voting part of the TVU can’t reset the tpu’s PoH, or that vote cancels the block.

My concern is that a faster asic could get this node to cancel its own block.

garious · 2019-02-08T01:05:46Z

Seems reasonable that a TVU would reject the TPUs block if a faster ASIC got its block to the TVU (and validated!) before the TPU finished its block. Feels a little awkward, but not wrong.

aeyakovenko · 2019-02-08T02:08:34Z

@garious, it would be great to somehow highlight the TVU vs TPU option as something we need more simulation with. We can’t enforce the behavior at the protocol layer, and I have no idea which is better for the network, or for the individual node.

aeyakovenko · 2019-02-08T05:26:45Z

@garious, @rob-solana. You would need to kill the TPU fork if the TVU votes for a different fork, or use another thread to generate PoH for the TPU.

There might also be races with an older fork completing while the TPU is still building its own fork.

garious · 2019-02-09T00:18:20Z

This is quite a bit different than the original design and could use a whole new review. @mvines, @sagar-solana, @carllin, looking in your general direction.

book/src/leader-validator-transition.md

aeyakovenko

Does the tvu run concurrently with the tpu? What happens to the PoH recorder that the tpu is using if the tvu votes while the tpu is running?

[saw the update, reset is blocked until tpu is done]

...where "live active chain" was changed to "active fork".

garious · 2019-02-13T01:33:58Z

@aeyakovenko, can you review? Since I've been pushing commits to your fork, GitHub won't let me add you as a reviewer.

garious · 2019-02-13T01:35:12Z

@mvines, any review comments? considering you've been knee deep in this in #2735.

aeyakovenko · 2019-02-13T15:19:36Z

@garious lgtm

garious · 2019-02-13T16:23:50Z

@aeyakovenko, no need to mention me or use the LGTM acronym. Just marking the PR as approved is sufficient.

aeyakovenko · 2019-02-13T17:07:32Z

@garious, I can’t approve my own pr

book/src/leader-validator-transition.md

* bank: add current_epoch_staked_nodes() Add current_epoch_staked_nodes() which returns the staked nodes for the current epoch. Remove Bank::staked_nodes() which used to return the bank's view of staked nodes updated to the last vote processed. The updated call sites don't really need super up to date stake info, and with this change we can stop updating staked node info for every vote on every bank. Instead we now compute it once per epoch. * bank: current_epoch_stakes: explain why self.epoch + 1

aeyakovenko requested review from garious, mvines, carllin, rob-solana and sagar-solana February 2, 2019 22:52

garious reviewed Feb 4, 2019

View reviewed changes

aeyakovenko changed the title ~~proposal for the leader validator loop~~ Propose a design for the leader validator loop Feb 4, 2019

garious reviewed Feb 4, 2019

View reviewed changes

book/src/leader-validator-transition.md Outdated Show resolved Hide resolved

garious self-assigned this Feb 5, 2019

garious force-pushed the leader_validator_loop branch from f260ca7 to f020c02 Compare February 6, 2019 23:22

garious changed the title ~~Propose a design for the leader validator loop~~ Add design for the leader validator loop Feb 7, 2019

garious reviewed Feb 7, 2019

View reviewed changes

garious reviewed Feb 9, 2019

View reviewed changes

book/src/leader-validator-transition.md Show resolved Hide resolved

aeyakovenko commented Feb 9, 2019

View reviewed changes

aeyakovenko added 5 commits February 8, 2019 20:20

proposal for the leader validator loop

bd9b487

more design

0f2a8bb

more docs

2831df5

docs

311bc67

fmt

6340214

aeyakovenko and others added 14 commits February 8, 2019 20:20

more docs

8b48c3f

comments

48c6ef4

docs

8775213

s/finalized/frozen/

53a211d

more docs

6c04193

update

9654759

removed bad rust and replaced with psudocode

8b858e0

Cleanup

6c70204

Add proposal to SUMMARY.md

ba1ecd2

Delete references to terminology defined in the Fork Deltas proposal

f3d2d71

...where "live active chain" was changed to "active fork".

Attempt to simplify the algorithm

496eac5

Fix numbers

49c5b12

Rewrite with just one PoH Recorder

685bd90

Review feedback

a8456d9

garious force-pushed the leader_validator_loop branch from e1df06b to a8456d9 Compare February 9, 2019 03:21

Delete the mostly unused list of components

79ed888

mvines reviewed Feb 13, 2019

View reviewed changes

Apply review feedback

704d3ec

mvines approved these changes Feb 13, 2019

View reviewed changes

garious merged commit aec44e3 into solana-labs:master Feb 13, 2019

adamlaska mentioned this pull request Apr 25, 2023

[Snyk] Security upgrade semantic-release from 19.0.3 to 20.0.1 adamlaska/solana#77

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add design for the leader validator loop #2650

Add design for the leader validator loop #2650

aeyakovenko commented Feb 2, 2019

garious left a comment

aeyakovenko commented Feb 4, 2019 •

edited by garious

Loading

aeyakovenko commented Feb 4, 2019

aeyakovenko commented Feb 7, 2019

garious commented Feb 7, 2019

rob-solana commented Feb 7, 2019

garious commented Feb 7, 2019

aeyakovenko commented Feb 8, 2019

garious commented Feb 8, 2019

aeyakovenko commented Feb 8, 2019

aeyakovenko commented Feb 8, 2019 •

edited

Loading

garious commented Feb 9, 2019

aeyakovenko left a comment •

edited

Loading

garious commented Feb 13, 2019

garious commented Feb 13, 2019

aeyakovenko commented Feb 13, 2019

garious commented Feb 13, 2019

aeyakovenko commented Feb 13, 2019

Add design for the leader validator loop #2650

Add design for the leader validator loop #2650

Conversation

aeyakovenko commented Feb 2, 2019

Problem

Summary of Changes

garious left a comment

Choose a reason for hiding this comment

aeyakovenko commented Feb 4, 2019 • edited by garious Loading

aeyakovenko commented Feb 4, 2019

aeyakovenko commented Feb 7, 2019

garious commented Feb 7, 2019

rob-solana commented Feb 7, 2019

garious commented Feb 7, 2019

aeyakovenko commented Feb 8, 2019

garious commented Feb 8, 2019

aeyakovenko commented Feb 8, 2019

aeyakovenko commented Feb 8, 2019 • edited Loading

garious commented Feb 9, 2019

aeyakovenko left a comment • edited Loading

Choose a reason for hiding this comment

garious commented Feb 13, 2019

garious commented Feb 13, 2019

aeyakovenko commented Feb 13, 2019

garious commented Feb 13, 2019

aeyakovenko commented Feb 13, 2019

aeyakovenko commented Feb 4, 2019 •

edited by garious

Loading

aeyakovenko commented Feb 8, 2019 •

edited

Loading

aeyakovenko left a comment •

edited

Loading