BlocksByRange under WS #2131

djrtwo · 2020-11-12T18:37:52Z

Partially addresses #2116

Defines MIN_EPOCHS_FOR_BLOCK_REQUESTS for minimum expectation of epoch range for serving blocks (and thus how far a new node must backfill)
Does not define a way to advertise this in Req/Resp (e.g. in Status) or in ENR
- I personally think it unwise to modify Status or MetaData (likely with a protocol ID update) this near to genesis launch. MIN_EPOCHS_FOR_BLOCK_REQUESTS gives us 4+ months to upgrade the req/resp to make this info available

The main thing the above solution won't capture during the first 4 months is the event that a node is back-filling blocks from a checkpoint state to genesis. In such a case, the peer might not be able to respond to BlocksByRange requests successfully. To handle this, I suggest we spec an error code for this case. This will carry forward in the future when BlocksByRange requests are made outside of the advertised range. We can either use 2 (ServerError) or define 3 (ResourceUnavailable)

Questions:

Are we okay leaving out advertisement for earliest epoch for block requests until a couple of months after mainnet?
Good with adding the error? If so, use 2 or define 3?

djrtwo · 2020-11-12T18:38:23Z

cc: @mbaxter @arnetheduck @ajsutton @AgeManning

mbaxter · 2020-11-12T18:55:00Z

I think it's fine to defer advertising the earliest available epoch - it does seem like the wrong time to be pushing this change in :).
The error makes sense to me, I vote for 3 (ResourceUnavailable).

AgeManning · 2020-11-13T00:45:31Z

I'm fine with this
Agree with 3 (ResourceUnavailable)

arnetheduck · 2020-11-15T07:52:07Z

ok
The wording of the text suggest that clients must offer up blocks - adding the possibility to claim ResourceUnavailable allows clients to keep claiming that indefinitely - the correct behaviour would be to simply sync from MIN_EPOCHS_FOR_BLOCK_REQUESTS an onwards instead of "backfilling" - this allows the client to report a "correct" head in the existing Status request without the loophole at the cost of slightly slowing down startup. We can deliberate the addition of an error code when extending Status/Metadata in the 4-5 months mentioned - until then, the language in this PR doesn't really allow for a conforming and honest client to return it ("must backfill" should however be replaced by "must sync from"). One issue would be that this slows down startup slightly because the client must now sync more blocks - but on the other hand, this benefits the network because we enshrine a supply of synced nodes more strongly - faster syncing lies in the domain of light clients, and can/should be solved then, along with archive solutions.

nisdas · 2020-11-15T10:35:00Z

This sounds good, gives us a bit of time to sort it out.
3 (ResourceUnavailable) is an appropriate error code for this.

ajsutton · 2020-11-15T21:24:48Z

the correct behaviour would be to simply sync from MIN_EPOCHS_FOR_BLOCK_REQUESTS an onwards instead of "backfilling"

This isn't possible. The sync has to start from the state provided by the user and go forward from there. Clients that respond with ResourceUnavailable would very likely be immediately disconnected as useless clients so there's no advantage to claiming that indefinitely.

AgeManning · 2020-11-16T00:00:45Z

Yeah. For the record. Our current strategy is going to be to ban any node that sends us this error (at least for the first 5 months).

The ban lasts ~40 mins which should be enough time for the peer to backfil etc.

djrtwo · 2020-11-16T15:30:33Z

This isn't possible. The sync has to start from the state provided by the user and go forward from there.

Yes, this is correct. We can't enforce that users must show up with a state that is exactly MIN_EPOCHS_FOR_BLOCK_REQUESTS in the past. And if we did, we would be forcing them to choose a WS state that is at the bounds of being safe (and for small validator sets, just simply unsafe).

Note, that for a backfill of blocks only to that boundary, the only validity checks (without getting an old state) that a node can do is check that these blocks for a hash chain. Because the WS state is trusted, this check is enough to show validity of the blocks for storage and future serving.

Note that if you backfill all the way to genesis, you could then check other validity conditions and could produce historic states.

arnetheduck · 2020-11-16T16:09:21Z

This isn't possible.

We can't enforce that users must show up with a state that is exactly

Why are the two related? When we ask for BlocksByRange, we can simply start at slot wallslot - MIN_EPOCHS_FOR_BLOCK_REQUESTS and should along the way hit the WS state - ie it's still a "backfill" in that sense, except that it's done in order and therefore it is more likely that the network keeps the relevant blocks around, per the 'MUST' requirement in this PR.

Because the WS state is trusted, this check is enough to show validity of the blocks for storage and future serving.

It should also do a signature check, else it can be poisoned with blocks that have the correct root but invalid signature - it should be possible since we have the proposer index from the block and the validator set of the WS state.

The point here is a bit that if we want to ensure that blocks are around, it's more likely that this will happen if the easiest thing to do is to get&store the blocks - starting range sync from WS state then "maybe" backfilling makes it less likely that the blocks will stay around - to make the sync requirement even stronger, it would make some sense to bake in "probe" range requests even if synced and disconnect any client that does not serve the blocks, or already-synced clients have little to lose (being disconnected by an unsynced client is a small loss - being disconnected by a synced client means a clear and present risk that your attestations will not make it through)

djrtwo · 2020-12-09T19:24:19Z

Why are the two related?

Because you have no way to validate a block from wallslot - MIN_EPOCHS_FOR_BLOCK_REQUESTS. You have no start block to check that it forms a chain, and you have no state to check state transition validity.

You can check that the blocks being served to you form a chain and you can even validate the proposer signature against the validator set you have from your WS state, but you cannot validate that the sequence of blocks being sent to you is valid wrt state transition nor can you validate that the blocks will ultimately reach the chain you decided was valid through you input of WS state.

Thus an attacker can serve you many blocks before you can know if they are valuable/valid, and thus can cause you to consume extra bandwidth.

It should also do a signature check, else it can be poisoned with blocks that have the correct root but invalid signature - it should be possible since we have the proposer index from the block and the validator set of the WS state.

Yes, agreed. Because you are storing the SignedBeaconBlock you'll want to validate that outer container.

## Issue Addressed Related to #1891, The error is not in the spec yet (see ethereum/consensus-specs#2131) ## Proposed Changes Implement the proposed error, banning peers that send it ## Additional Info NA

…locks on WS period

specs/phase0/p2p-interface.md

…table

ajsutton

LGTM.

There was at one point an error code that could be returned for any blocks by range request the node couldn't fulfil because they don't yet have the blocks. I still think that would be very useful as it makes it explicit that the node doesn't have the blocks rather than returning the empty response which could mean they don't have the blocks or that there were no blocks.

specs/phase0/p2p-interface.md

ralexstokes

generally looks good! let some minor notes/comments

README.md

specs/phase0/p2p-interface.md

specs/phase0/weak-subjectivity.md

specs/phase0/p2p-interface.md

arnetheduck · 2021-05-12T05:48:29Z

Because you have no way to validate a block from wallslot - MIN_EPOCHS_FOR_BLOCK_REQUESTS. You have no start block to check that it forms a chain, and you have no state to check state transition validity.

This is something we can fix I think: turning historical_roots into historical_block_roots (without mixing in the state), we could validate groups of 8192 blocks at a time which would provide a good upper bound on the damage one can cause.

Co-authored-by: Alex Stokes <[email protected]> Co-authored-by: Jacek Sieka <[email protected]>

djrtwo · 2021-05-12T14:37:13Z

Added the 3: ResourceUnavailable response code. I know we are generally pretty conservative on adding such codes, but 3 of 4 client teams are in vocal support

djrtwo force-pushed the bbr-ws branch from 731016e to a7f5527 Compare November 12, 2020 18:39

mbaxter mentioned this pull request Nov 17, 2020

[Issue-3063] Use standard error when blocks are unavailable Consensys/teku#3245

Closed

1 task

divagant-martian mentioned this pull request Dec 9, 2020

[Merged by Bors] - impl Resource Unavailable RPC error sigp/lighthouse#2072

Closed

djrtwo added 2 commits December 9, 2020 12:27

more clearly define min epoch range for blocksbyrange requests

17221c8

add note about signature check when backfilling beaconblocks

56aafbe

djrtwo force-pushed the bbr-ws branch from a7f5527 to 56aafbe Compare December 9, 2020 19:38

hwwhww added scope:networking scope:weak-subjectivity labels Dec 15, 2020

djrtwo force-pushed the bbr-ws branch from da4fce5 to 44c3b16 Compare January 14, 2021 01:05

arnetheduck approved these changes Jan 14, 2021

View reviewed changes

add ability for node to randomly request and descore if not serving b…

2ad8fdb

…locks on WS period

djrtwo force-pushed the bbr-ws branch from 44c3b16 to 2ad8fdb Compare January 14, 2021 13:53

mbaxter reviewed Jan 14, 2021

View reviewed changes

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

adiasg mentioned this pull request Apr 6, 2021

Fix is_within_weak_subjectivity_period() assertion #2304

Open

arnetheduck mentioned this pull request May 5, 2021

Implement 2 minutes beacon node sync with checkpoint state status-im/nimbus-eth2#2530

Closed

djrtwo added this to the v1.1.0-alpha.4 milestone May 11, 2021

Merge branch 'dev' into bbr-ws

5792afc

add notes about repeatedly failing tos erve blocks as being disconnce…

488ceed

…table

djrtwo requested a review from ralexstokes May 11, 2021 17:30

ajsutton reviewed May 11, 2021

View reviewed changes

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

ralexstokes approved these changes May 12, 2021

View reviewed changes

README.md Outdated Show resolved Hide resolved

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

specs/phase0/weak-subjectivity.md Show resolved Hide resolved

arnetheduck reviewed May 12, 2021

View reviewed changes

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

arnetheduck reviewed May 12, 2021

View reviewed changes

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

djrtwo and others added 2 commits May 12, 2021 08:29

Apply suggestions from code review

82b7a7b

Co-authored-by: Alex Stokes <[email protected]> Co-authored-by: Jacek Sieka <[email protected]>

add resourceunavailable error code

f52f067

djrtwo merged commit 84830e8 into dev May 12, 2021

djrtwo deleted the bbr-ws branch May 12, 2021 14:51

twoeths mentioned this pull request May 20, 2021

Altair 1.1.0-alpha.4 ChainSafe/lodestar#2530

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BlocksByRange under WS #2131

BlocksByRange under WS #2131

djrtwo commented Nov 12, 2020

djrtwo commented Nov 12, 2020

mbaxter commented Nov 12, 2020

AgeManning commented Nov 13, 2020

arnetheduck commented Nov 15, 2020

nisdas commented Nov 15, 2020

ajsutton commented Nov 15, 2020

AgeManning commented Nov 16, 2020

djrtwo commented Nov 16, 2020

arnetheduck commented Nov 16, 2020 •

edited

Loading

djrtwo commented Dec 9, 2020

ajsutton left a comment

ralexstokes left a comment

arnetheduck commented May 12, 2021

djrtwo commented May 12, 2021

BlocksByRange under WS #2131

BlocksByRange under WS #2131

Conversation

djrtwo commented Nov 12, 2020

djrtwo commented Nov 12, 2020

mbaxter commented Nov 12, 2020

AgeManning commented Nov 13, 2020

arnetheduck commented Nov 15, 2020

nisdas commented Nov 15, 2020

ajsutton commented Nov 15, 2020

AgeManning commented Nov 16, 2020

djrtwo commented Nov 16, 2020

arnetheduck commented Nov 16, 2020 • edited Loading

djrtwo commented Dec 9, 2020

ajsutton left a comment

Choose a reason for hiding this comment

ralexstokes left a comment

Choose a reason for hiding this comment

arnetheduck commented May 12, 2021

djrtwo commented May 12, 2021

arnetheduck commented Nov 16, 2020 •

edited

Loading