Add creation time for DHT authority discovery records #91

alexggh · 2024-05-20T12:20:55Z

This adds a new creation time for authority discovery records stored in the DHT.

This RFC has already an implementation in polkadot-sdk here: paritytech/polkadot-sdk#3786

Signed-off-by: Alexandru Gheorghe <[email protected]>

tomaka · 2024-05-20T12:35:57Z

text/0091-dht-record-creation-time.md

+
+## Motivation
+
+Currently, we use the Kademlia DHT for storing records regarding the p2p address of an authority discovery key, the problem is that if the nodes decide to change its PeerId/Network key it will publish a new record, however because of the distributed and replicated nature of the DHT there is no way to tell which record is newer so both old PeerId and the new PeerId will live in the network until the old one expires(36h), that creates all sort of problem and leads to the node changing its address not being properly connected for up to 36h. 


the problem is that if the nodes decide to change its PeerId/Network key it will publish a new record, however because of the distributed and replicated nature of the DHT there is no way to tell which record is newer so both old PeerId and the new PeerId will live in the network until the old one expires(36h)

I don't think that's true? The new record is very much supposed to overwrite the old one.

"Putting a record" consists in finding the 20 peers whose PeerId is the closest to the relevant key (here the relevant key is the authority discovery key) and sending them the record. If later you want to modify the record, you find these 20 peers again (they are normally still the same) and send them a new record, which overwrites the old one.

I don't think that's true? The new record is very much supposed to overwrite the old one.

I've tested this several times on different network configurations and the old record still survives in the network until it expires, as far as I can understand this happens because you do publish the record initially to the closest 20 nodes, but then when the record reaches for different reasons other nodes will still store it locally with this formula: https://github.com/libp2p/rust-libp2p/blob/ad7ad5b3fc5b4bc9a431ece90e9a5ce8c33ca0e2/protocols/kad/src/behaviour.rs#L1796, so not just the initial 20 nodes would store a record.

when the record reaches for different reasons

Well, this isn't really supposed to happen. It can happen in some rare cases where the publisher has been lied to during its iterative query, for example.

Even if the record reaches nodes other than the initial 20 closest nodes, reading the record (i.e. querying it) is always done against the 20 closest nodes.

Alright, double checked again, nodes don't actually have perfect connectivity, so their view of what are peer ids are not always the same, especially after your restart, so we end up in a situation where we don't always publish to the same 20 closest nodes, so you need just one node to still have the old record and then because that will replicate the record to its view of 20 closest nodes, we end up in a situation where the old record will override the new record.

The issue can be replicated systematically, by running a network of around 30-40 nodes, you let it run for sometime then you restart one of the node with a different PeerID.

So, the exact scenario I'm testing is like this:

Run a stable polkadot network.

Just restart a single node with a new network key(PeerId) which makes the node to publish a new value on the dht for the key it is responsible.

What I'm observing is that the old record of the single node is actually on more than the 20 closest nodes that get updated at step 2) and because those nodes do replication https://github.com/libp2p/rust-libp2p/blob/ad7ad5b3fc5b4bc9a431ece90e9a5ce8c33ca0e2/protocols/kad/src/behaviour.rs#L2520 regularly, it is overwriting the newer record.

I consider this to be a valid use case since it is what happens when nodes upgrade and forget to persist their network key, it happened a few times in the past on kusama.

For the tests I ran the network does not seem to be split since I see everyone communicating with everyone on the other protocols, so what do you think am I missing here ?

This whole discussion is off-topic for this RFC, but the record could be on more than 20 nodes simply because during step 1 you have temporary net splits. What's important is that the 20 nodes where the record is updated at step 2 are the same as the ones queried by the rest of the network when they want to read the record. The logs would tell you that.

What's important is that the 20 nodes where the record is updated at step 2 are the same as the ones queried by the rest of the network when they want to read the record. The logs would tell you that

Right, but because the one(outside of the closest 20) still has the old record and does replication on the closest 20, all will have the old record.

This whole discussion is off-topic for this RFC

It is the reason I started the RFC in the first place, but I agree it shouldn't affect the RFC, I'll go ahead and apply your suggestion about using a single signature.

Right, but because the one(outside of the closest 20) still has the old record and does replication on the closest 20, all will have the old record.

As far as I can remember only the original publisher periodically re-publishes and can periodically re-publish.
When receiving a record, a node ensures that the PeerId of the sender of the record (i.e. the "live" PeerId of the connection where the record is being received) can be found in the list of PeerIds that is found in the record and that is covered by the signature.
Otherwise you would have a pretty obvious vulnerability, and we thought of these kind of things when designing the system.

It is the reason I started the RFC in the first place, but I agree it shouldn't affect the RFC, I'll go ahead and apply your suggestion about using a single signature.

I see this RFC as a hack to work around a bug somewhere, and it would be a better idea to actually fix the bug. However I'm personally not really involved in Polkadot anymore, so I'm not going to vote against.

Back to this.

As far as I can remember only the original publisher periodically re-publishes and can periodically re-publish.
When receiving a record, a node ensures that the PeerId of the sender of the record (i.e. the "live" PeerId of the connection where the record is being received) can be found in the list of PeerIds that is found in the record and that is covered by the signature.
Otherwise you would have a pretty obvious vulnerability, and we thought of these kind of things when designing the system.

Yes, you are correct, re-publishing is done only by the original published, but there is also replication which is done by all peers that store the record, the integrity of the record is guaranteed by the fact that the signature is checked, so you can't replicate anything, you need to replicate the original record.

I see this RFC as a hack to work around a bug somewhere, and it would be a better idea to actually fix the bug. However I'm personally not really involved in Polkadot anymore, so I'm not going to vote against.

I've investigated this and I couldn't find anything wrong with the way it works, the scenario this fixes it is just a consequence of the fact that not all nodes have the same routing table always, so at different point in time the 20 closest 20 nodes could differ slightly.

However I'm personally not really involved in Polkadot anymore, so I'm not going to vote against.

Then if you don't strongly oppose this RFC, I intend to move forward with adding this new field(creation_time) since I see benefits in having it, so we can use it to always pick the newest record.

text/0091-dht-record-creation-time.md

... to sign it only once Signed-off-by: Alexandru Gheorghe <[email protected]>

bkchr

Just two minor questions, otherwise it looks good to me 👍

bkchr · 2024-07-04T15:30:34Z

text/0091-dht-record-creation-time.md

+
+```
+
+Each time a node wants to resolve an authorithy ID it will issue a query with a certain redundancy factor, and from all the results it receives it will decide to pick only the newest record. Additionally, the nodes that answer with old records will be updated with the newer record.


Additionally, the nodes that answer with old records will be updated with the newer record.

So, the node that send the DHT request will inform these nodes?

Strictly speaking, I don't think that this needs to be part of the RFC, because even a node not doing this would behave well.

Yes, that's correct, I was just trying to explain how a well behaving validator should/could use this field, I can explicitly state that this part is optional or remove it.

text/0091-dht-record-creation-time.md

Signed-off-by: Alexandru Gheorghe <[email protected]>

bkchr · 2024-07-05T14:32:42Z

/rfc propose

paritytech-rfc-bot · 2024-07-05T14:32:55Z

Hey @bkchr, here is a link you can use to create the referendum aiming to approve this RFC number 0091.

Instructions

Open the link.
Switch to the Submission tab.

Adjust the transaction if needed (for example, the proposal Origin).
Submit the Transaction

It is based on commit hash fbd4296f2a8a14cb2a17b2b0f0260b1df6e80783.

The proposed remark text is: RFC_APPROVE(0091,3df081c71e09b40d9057410b431accdf8fd0a7b20e58d176d47411fb1aa7083d).

github-actions · 2024-07-05T18:19:29Z

Voting for this referenda is ongoing.

Vote for it here

github-actions · 2024-07-11T12:31:13Z

PR can be merged.

Write the following command to trigger the bot

/rfc process 0x997290e7221101d7fe625a2f4eb58dd73393845a696d3ac3682c9f546a25c00b

bkchr · 2024-07-15T09:33:31Z

/rfc process 0x997290e7221101d7fe625a2f4eb58dd73393845a696d3ac3682c9f546a25c00b

paritytech-rfc-bot · 2024-07-15T09:33:48Z

The on-chain referendum has approved the RFC.

alexggh added 4 commits May 20, 2024 15:17

Add creation time for DHT authority discovery records

bd404c6

Signed-off-by: Alexandru Gheorghe <[email protected]>

Add rfc number

15568a1

Signed-off-by: Alexandru Gheorghe <[email protected]>

Minor formatting issues

f5aed7e

Signed-off-by: Alexandru Gheorghe <[email protected]>

Fix typos

e78000a

Signed-off-by: Alexandru Gheorghe <[email protected]>

alexggh marked this pull request as ready for review May 20, 2024 12:28

alexggh mentioned this pull request May 20, 2024

authorithy-discovery: Make changing of peer-id while active a bit more robust paritytech/polkadot-sdk#3786

Merged

4 tasks

tomaka reviewed May 20, 2024

View reviewed changes

text/0091-dht-record-creation-time.md Outdated Show resolved Hide resolved

turuslan mentioned this pull request May 29, 2024

[Feature Request]: DHT Authority discovery record creation time qdrvm/kagome#2108

Closed

Move creation_time inside the authority record

73667cb

... to sign it only once Signed-off-by: Alexandru Gheorghe <[email protected]>

bkchr approved these changes Jul 4, 2024

View reviewed changes

Clearly state the optional behaviour

fbd4296

Signed-off-by: Alexandru Gheorghe <[email protected]>

paritytech-rfc-bot bot merged commit 65cf83e into polkadot-fellows:main Jul 15, 2024

turuslan mentioned this pull request Jul 17, 2024

audi timestamp qdrvm/kagome#2151

Merged

ggwpez mentioned this pull request Sep 9, 2024

Parent issue for stable2409 LTS release paritytech/polkadot-sdk#5583

Closed

anaelleltd added the Implemented Is merged or live as a feature/service. label Sep 10, 2024

TDemeco mentioned this pull request Oct 14, 2024

feat: ⏫ upgrade to Polkadot SDK stable2409 Moonsong-Labs/storage-hub#228

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add creation time for DHT authority discovery records #91

Add creation time for DHT authority discovery records #91

alexggh commented May 20, 2024 •

edited

Loading

tomaka May 20, 2024

tomaka May 20, 2024 •

edited

Loading

alexggh May 20, 2024

tomaka May 20, 2024 •

edited

Loading

alexggh May 24, 2024

alexggh May 24, 2024

tomaka May 24, 2024

alexggh May 24, 2024

tomaka May 24, 2024 •

edited

Loading

alexggh Jun 25, 2024 •

edited

Loading

bkchr left a comment

bkchr Jul 4, 2024

alexggh Jul 4, 2024

bkchr commented Jul 5, 2024

paritytech-rfc-bot bot commented Jul 5, 2024

github-actions bot commented Jul 5, 2024

github-actions bot commented Jul 11, 2024

bkchr commented Jul 15, 2024

paritytech-rfc-bot bot commented Jul 15, 2024


		## Motivation

		Currently, we use the Kademlia DHT for storing records regarding the p2p address of an authority discovery key, the problem is that if the nodes decide to change its PeerId/Network key it will publish a new record, however because of the distributed and replicated nature of the DHT there is no way to tell which record is newer so both old PeerId and the new PeerId will live in the network until the old one expires(36h), that creates all sort of problem and leads to the node changing its address not being properly connected for up to 36h.

Add creation time for DHT authority discovery records #91

Add creation time for DHT authority discovery records #91

Conversation

alexggh commented May 20, 2024 • edited Loading

Choose a reason for hiding this comment

tomaka May 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomaka May 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomaka May 24, 2024 • edited Loading

Choose a reason for hiding this comment

alexggh Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

bkchr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bkchr commented Jul 5, 2024

paritytech-rfc-bot bot commented Jul 5, 2024

github-actions bot commented Jul 5, 2024

github-actions bot commented Jul 11, 2024

bkchr commented Jul 15, 2024

paritytech-rfc-bot bot commented Jul 15, 2024

alexggh commented May 20, 2024 •

edited

Loading

tomaka May 20, 2024 •

edited

Loading

tomaka May 20, 2024 •

edited

Loading

tomaka May 24, 2024 •

edited

Loading

alexggh Jun 25, 2024 •

edited

Loading