Peer backup storage (feature 40/41/42/43) #881

t-bast · 2021-06-28T08:26:56Z

Nodes can offer to altruistically store small amounts of data on behalf of their channel peers. It complements option_data_loss_protect and can let nodes that lost data fully recover their channels without any on-chain transaction.

The contents of the peer_backup is left unspecified to offer wide flexibility for implementations. It can for example contain an encrypted version of your internal channel state, or a mix of important data from several channels, it's really up to the implementations to go #reckless.

Please note that a drawback is that this uses a lot of space in the tlv stream of existing messages, which means there won't be a lot of additional space available if we want to enrich these messages with new tlv fields. We should choose the maximum size of backups carefully if we don't want to paint ourselves into a corner.

SomberNight · 2021-06-29T15:14:01Z

Thank you for writing this up!
In general I like this proposal very much, and we would definitely want to implement this (or whatever it evolves into) into Electrum.

On first read I had assumed that the backup blobs are per-peer but then realised they are actually per-channel.

One thing I would like to note is the usecase where, after restoring from seed, wallet software might be able to find some of their channel peers but not the channels themselves. i.e. Alice's wallet might know they have some channels with Bob but not know any of the channel ids (funding outpoints). In such a case Alice can establish a transport with Bob, and then wait for Bob to send one or more channel_reestablish messages. As there is no way for Alice to "request" the backup blobs (and she does not even know what the channel ids are), she must rely on Bob sending the channel_reestablish messages (regardless of who initated the transport).

One thing I am not clear is the (a)symmetry of this feature. I think this needs to be cleared up.
The PR text talks about "storage provider"s, and suggests an asymmetric relationship where e.g. a forwarding node offers storing backups for a light client. However, negotiating a feature such as peer_backup_storage is inherently symmetric with current BOLT-09. So how exactly does this look like?

Let's say there is a light client who wants to store backups with their peer (Carol); and let's say the light client does not want to store backups for the other party (or does it need to? it could I guess). Would the light client set peer_backup_storage in e.g. the init message? Would Carol set it? How would Carol know that they are the "service provider" in this relationship?
Or is it that they both must store blobs for the other one, and then the relationship is symmetric?

I am also wondering if the feature can even be used in a symmetric way.
If two counterparties of a channel, Alice and Bob, want to store backups with each other, and Alice loses state, then she will not be able to send the last backup sent by Bob. So assuming the backup blobs are used to in a Phoenix-like usecase (resume channel operation after restoring from seed); if the feature was used in a symmetric way (Alice storing blobs at Bob, and Bob storing blobs at Alice), then if either party loses state, the channel would get closed by the other party, I guess(?).

t-bast · 2021-06-30T08:35:36Z

Thanks @SomberNight this is very good feedback. I've started with the minimal approach we use in Phoenix, which is simpler than the general case because Phoenix is always fundee and initiates reconnections, but this PR is a good opportunity to offer a backup mechanism that supports more scenarios.

I think it would make sense to have two distinct types of backups, which would address the scenario described in your first paragraph:

channel backups (one per channel, as described in d873416), which may be updated at every commitment change
a node backup which can contain data about channels and peers

We could store in the node backup the list of our peers and the channel IDs we have with each peer (encrypted of course). That node backup could be sent to all of our peers: this way we only need to remember and reconnect to one of them to discover the others and the list of our channels.

We would need a new message to ask our peers to store an updated version of our node_backup_data. We would send that message for example whenever we've opened a new channel, to add the new channel ID to our backup. Our storage provider peers would send us back the latest version of that node_backup_data in init. At a high-level, does that sound like it would address your first comments?

Regarding the asymmetry of this feature, I think it is both desired and representative of real-world scenarios.
Let's detail for example the scenario where a mobile wallet Alice connects to a storage provider Bob.
Alice would not turn on the peer_backup_storage feature, because Alice doesn't offer to store people's data.
Bob would turn on the peer_backup_storage feature, which tells Alice that she can send her backups.

You're completely right to highlight that the case where both peers store each other's backups doesn't work: if one of them loses data, that will force the other to close channels. I don't think this feature is meant to be used between routing nodes running on servers. This type of node should implement DB redundancy internally to ensure they never lose data (instead of relying on their peers for that). I think this feature should only be used for mobile wallets (or more generally light clients) connecting to "service providers". But your comment made me realize that if two service providers connect to each other they will both have the peer_backup_storage feature activated but should not send backups to each other. I'm wondering if this should simply be covered by implementations (only mobile wallet implementations would actually send their backup data, server implementations would never send them nor expect peers to store their data) or if we should use distinct feature bits for "I can store your data" and "I'd like you to store my data" (but with the somewhat complex restriction that you cannot have two peers turn on both at the same time otherwise they'd run in the issue you mention if one of them loses data). What do you think?

SomberNight · 2021-06-30T11:21:10Z

We would need a new message to ask our peers to store an updated version of our node_backup_data. We would send that message for example whenever we've opened a new channel, to add the new channel ID to our backup. Our storage provider peers would send us back the latest version of that node_backup_data in init. At a high-level, does that sound like it would address your first comments?

Yes, that sounds good. I think it might not be strictly needed, as a workaround could be the light client always waiting for the remote sending channel_reestablish first; and this kind of data (list of your peers and list of channel ids) could be stored as part of the per-channel blob. However, that is a bit hackish and I think it would be worth it to introduce this new message for per-peer backups; IMHO it would even simplify implementations overall. Also, having both per-channel and per-peer backups could enable more use cases in the future. For example, not tying some backups to a specific channel could allow a market where nodes could pay peers they don't have a channel with to store data for them.

if two service providers connect to each other they will both have the peer_backup_storage feature activated but should not send backups to each other. I'm wondering if this should simply be covered by implementations (only mobile wallet implementations would actually send their backup data, server implementations would never send them nor expect peers to store their data) or if we should use distinct feature bits for "I can store your data" and "I'd like you to store my data" (but with the somewhat complex restriction that you cannot have two peers turn on both at the same time otherwise they'd run in the issue you mention if one of them loses data). What do you think?

I think one feature bit is sufficient. It could behave as follows, given a transport between Alice and Bob:

if Alice has the feature set, Bob does not: Bob can send backups to be stored at Alice, Alice must not
if Alice and Bob both have the feature set, neither of them is allowed to send backups to be stored by the other
So the feature bit means "I am willing to store your backups", but if both parties set it then neither can utilise it.
So light clients would never set the feature, "service providers" could always set it (if they implement it); and if two service providers peer, they would recognise it.

One thing to note here is that light clients might want to "require" this feature but there isn't really a way to communicate that fact. Maybe that is fine as they could simply not peer with nodes that do not set it.

Another point is about deleting backups. If Alice ("light client") and Bob ("service provider") have a single channel, and Alice stores backups with Bob, and Bob e.g. force-closes the channel, would Bob now delete Alice's backup? I think Bob could keep the backups for some time or maybe even indefinitely (on a best effort basis) - as even then the service seems hard to DOS: opening channels is probably costly enough even if they end up being closed. I suggest the spec include a recommendation about what to do if there no longer are any open channels with a peer but the peer has stored backups with us.

One higher level point re use cases: I am wondering what the assumptions are when using this for "resumable channels". e.g. restoring from seed and discovering your existing channels, keeping them open and keep using them. Phoenix does this of course; but in that case the clients probably already have some trust in the ACINQ node... What do you think would be the implications of using it with arbitrary nodes? I think it might only work well if there is at least some trust. Unfortunately I don't think the game theory works out well: if the channel funder is the light client (user), there is virtually no risk for the remote for sending an old blob to the client on reestablish. They only risk missing out on future forwarding fees... their potential gain compared to that is huge :/

t-bast · 2021-06-30T12:46:01Z

I think one feature bit is sufficient.

Yes, I think so as well, adding a second one wouldn't be a very clean "fix".

One thing to note here is that light clients might want to "require" this feature but there isn't really a way to communicate that fact. Maybe that is fine as they could simply not peer with nodes that do not set it.

Exactly, I think that instead of requiring it, light clients would just scan node_announcements and connect only to nodes that advertised the feature (if they need peers that can store their backups).

I suggest the spec include a recommendation about what to do if there no longer are any open channels with a peer but the peer has stored backups with us.

This is a good point, the spec should mention that storage providers should (best effort) keep the latest backups stored even after closing the channel. That's what eclair currently does, as it doesn't cost much space (and as you mention, it's not subject to DoS since the size of backups is limited by the message size and there is an economic cost in the channel creation).

I'll add a section mentioning that.

What do you think would be the implications of using it with arbitrary nodes? I think it might only work well if there is at least some trust. Unfortunately I don't think the game theory works out well: if the channel funder is the light client (user), there is virtually no risk for the remote for sending an old blob to the client on reestablish. They only risk missing out on future forwarding fees...

I agree with you, this backup mechanism isn't completely bullet-proof. You can check that your peer behaves correctly at each channel_reestablish and init, but if they misbehave they won't be directly economically penalized (there's no penalty fee that you can automatically claim). As you mention, the only drawback for the cheating storage provider is that you won't use their services, and will probably tell your friends to avoid them as well. Is that a good enough incentive? I don't know, but I'm not sure we can do better...

I'll work on adding a commit to include the node_backup_data and the new associated message.

t-bast · 2021-07-07T16:47:47Z

@SomberNight I add the node_backup and several other requirements and explanations in 18bd2af, let me know if that's better.

01-messaging.md

rustyrussell · 2021-07-19T03:11:09Z

I like this! But I think we should downgrade it to strictly "best effort", at least for peer_backup.

If they give you an old state, don't get upset. This can happen anyway, since there's no ack.
Require only best effort, not require them to perform a db commit on every new update.
Importantly, this lets peers provide it for each other, without fear.
Perhaps we should have an "append" flag to reduce data traffic? (The total size must be less than 32000 bytes still?).

The way I would use this feature is to store the commitment number in every peer_backup, and update at least three peers on every send of revoke_and_ack. That is the critical piece of information so we don't accidentally go backwards on restore.

Then have a rule on restart that insists on a threshold of init msgs before sending reestablish messages.

I also think it would be informative (perhaps in a new doc) to indicate the format implementations used: a TLV seems to fit pretty well here, IMHO, but details of encryption matter too.

rustyrussell · 2021-07-27T01:08:11Z

Features clash with #759 BTW!

t-bast · 2021-08-10T08:25:12Z

I like this! But I think we should downgrade it to strictly "best effort", at least for peer_backup.
If they give you an old state, don't get upset. This can happen anyway, since there's no ack.

You're right, I made it best effort in 3129979

Perhaps we should have an "append" flag to reduce data traffic?

It's worth discussing, if we think it makes sense we should include it now in the backup tlv (to avoid adding a new tlv later for it).
I'm afraid it's useless with encrypted backups, so it only makes sense for unencrypted data (array of (channel_id, commitment_number)), but I personally would only send encrypted data to avoid leaking anything (just in case).

I also think it would be informative (perhaps in a new doc) to indicate the format implementations used: a TLV seems to fit pretty well here, IMHO, but details of encryption matter too.

I agree, bLIPs are likely a good place for that.

Features clash with #759 BTW!

Fixed in 42e68e0

I realized that my calculation for the maximum size for backups is completely wrong...
I considered a commitment_signed message with 483 htlc signatures, but it can actually contain two times that since there can be 483 incoming htlcs and 483 outgoing htlcs 🤦 (even though nodes may configure a lower max_accepted_htlcs to ensure there is some space available for backup data).

When that happens, there isn't much space available in the commitment_signed message for backup data (especially if we want to leave some space for future additional tlv fields). Maybe that's ok as you will soon follow up with a revoke_and_ack where you'll have enough space to send your backup data...

Should we still define an arbitrary maximum size for backup data (e.g. 32768 bytes, powers of two are nice) and add a requirement that the sender must not include the backup if that would cause the message to be bigger than the 65535 bytes message size limit? Or should we completely drop the maximum size for backup data and let it be bounded by the message size limit (65535 bytes)?

akumaigorodski · 2021-08-23T08:43:50Z

@t-bast

We could store in the node backup the list of our peers and the channel IDs we have with each peer (encrypted of course). That node backup could be sent to all of our peers: this way we only need to remember and reconnect to one of them to discover the others and the list of our channels.

It seems to me this is actually the most important part at least for as long as we need to store revoked commit history for each channel. That history can't be attached to channel messages TLV and restoring and then keep using channels without it is obviously dangerous, they better be closed ASAP.

OTOH an encrypted list of peers is small and wallet user won't need to remember all peers they had channels with on wallet recovery, which is very useful.

rustyrussell · 2022-03-10T02:17:05Z

Don't add TLVs everywhere, simply use a separate message? That increases the limit (to its natural amount), as well.

t-bast commented Jul 7, 2021

View reviewed changes

01-messaging.md Outdated Show resolved Hide resolved

t-bast mentioned this pull request Jul 9, 2021

Asymmetric features #885

Closed

t-bast changed the title ~~Peer backup storage (feature 36/37)~~ Peer backup storage (feature 36/37/38/39) Jul 12, 2021

t-bast mentioned this pull request Jul 16, 2021

Lightning Specification Meeting 2021/07/19 #888

Closed

15 tasks

t-bast changed the title ~~Peer backup storage (feature 36/37/38/39)~~ Peer backup storage (feature 40/41/42/43) Aug 10, 2021

t-bast mentioned this pull request Aug 10, 2021

Lightning Specification Meeting 2021/08/16 #893

Closed

18 tasks

t-bast mentioned this pull request Aug 27, 2021

Lightning Specification Meeting 2021/08/30 #901

Closed

15 tasks

t-bast mentioned this pull request Sep 8, 2021

Lightning Specification Meeting 2021/09/13 #909

Closed

19 tasks

t-bast mentioned this pull request Sep 23, 2021

Lightning Specification Meeting 2021/09/27 #916

Closed

19 tasks

t-bast mentioned this pull request Oct 6, 2021

Lightning Specification Meeting 2021/10/11 #920

Closed

19 tasks

t-bast mentioned this pull request Oct 21, 2021

Lightning Specification Meeting 2021/10/25 #929

Closed

21 tasks

t-bast mentioned this pull request Nov 5, 2021

Lightning Specification Meeting 2021/11/08 #933

Closed

21 tasks

t-bast mentioned this pull request Nov 22, 2021

Lightning Specification Meeting 2021/11/22 #936

Closed

21 tasks

t-bast mentioned this pull request Dec 6, 2021

Lightning Specification Meeting 2021/12/06 #943

Closed

21 tasks

t-bast mentioned this pull request Dec 14, 2021

Lightning Specification Meeting 2021/12/20 #945

Closed

19 tasks

t-bast mentioned this pull request Jan 3, 2022

Lightning Specification Meeting 2022/01/03 #949

Closed

21 tasks

t-bast mentioned this pull request Jan 12, 2022

Lightning Specification Meeting 2022/01/17 #952

Closed

18 tasks

t-bast mentioned this pull request Jan 27, 2022

Lightning Specification Meeting 2022/01/31 #955

Closed

17 tasks

t-bast mentioned this pull request Feb 9, 2022

Lightning Specification Meeting 2022/02/14 #957

Closed

17 tasks

t-bast mentioned this pull request Feb 25, 2022

Lightning Specification Meeting 2022/02/28 #965

Closed

19 tasks

t-bast mentioned this pull request Aug 23, 2023

Lightning Specification Meeting 2023/08/28 #1103

Closed

23 tasks

t-bast mentioned this pull request Sep 8, 2023

Lightning Specification Meeting 2023/09/11 #1107

Closed

24 tasks

adi2011 mentioned this pull request Sep 15, 2023

Peer storage for nodes to distribute small encrypted blobs. #1110

Open

t-bast mentioned this pull request Sep 21, 2023

Lightning Specification Meeting 2023/09/25 #1114

Closed

24 tasks

t-bast mentioned this pull request Oct 5, 2023

Lightning Specification Meeting 2023/10/23 #1115

Closed

22 tasks

t-bast mentioned this pull request Oct 30, 2023

Lightning Specification Meeting 2023/11/06 #1116

Closed

20 tasks

t-bast mentioned this pull request Nov 16, 2023

Lightning Specification Meeting 2023/11/20 #1118

Closed

20 tasks

t-bast mentioned this pull request Dec 4, 2023

Lightning Specification Meeting 2023/12/04 #1122

Closed

20 tasks

t-bast mentioned this pull request Dec 13, 2023

Lightning Specification Meeting 2023/12/18 #1124

Closed

20 tasks

t-bast mentioned this pull request Jan 8, 2024

Lightning Specification Meeting 2024/01/15 #1127

Closed

21 tasks

t-bast mentioned this pull request Jan 22, 2024

Lightning Specification Meeting 2024/01/29 #1129

Closed

19 tasks

t-bast mentioned this pull request Feb 5, 2024

Lightning Specification Meeting 2024/02/12 #1134

Closed

22 tasks

t-bast mentioned this pull request Feb 21, 2024

Lightning Specification Meeting 2024/02/26 #1142

Closed

21 tasks

t-bast mentioned this pull request Mar 7, 2024

Lightning Specification Meeting 2024/03/11 #1146

Closed

21 tasks

t-bast mentioned this pull request Mar 20, 2024

Lightning Specification Meeting 2024/03/25 #1150

Closed

21 tasks

ellemouton mentioned this pull request Apr 2, 2024

Peer backup lightningnetwork/lnd#8490

Open

8 tasks

Roasbeef mentioned this pull request Apr 8, 2024

Lightning Specification Meeting 2024/04/08 #1152

Closed

21 tasks

t-bast mentioned this pull request Apr 18, 2024

Lightning Specification Meeting 2024/04/22 #1155

Closed

22 tasks

t-bast mentioned this pull request May 2, 2024

Lightning Specification Meeting 2024/05/06 #1161

Closed

25 tasks

t-bast mentioned this pull request May 14, 2024

Lightning Specification Meeting 2024/05/20 #1164

Closed

23 tasks

t-bast mentioned this pull request Jun 3, 2024

Lightning Specification Meeting 2024/06/03 #1167

Closed

23 tasks

t-bast mentioned this pull request Jun 17, 2024

Lightning Specification Meeting 2024/06/17 #1172

Closed

22 tasks

t-bast mentioned this pull request Jun 26, 2024

Lightning Specification Meeting 2024/07/01 #1175

Closed

23 tasks

t-bast mentioned this pull request Jul 12, 2024

Lightning Specification Meeting 2024/07/15 #1183

Closed

22 tasks

t-bast mentioned this pull request Jul 23, 2024

Lightning Specification Meeting 2024/07/29 #1185

Closed

23 tasks

t-bast mentioned this pull request Aug 9, 2024

Lightning Specification Meeting 2024/08/12 #1187

Closed

21 tasks

t-bast mentioned this pull request Aug 23, 2024

Lightning Specification Meeting 2024/08/26 #1191

Closed

20 tasks

t-bast mentioned this pull request Sep 6, 2024

Lightning Specification Meeting 2024/09/09 #1195

Closed

20 tasks

t-bast mentioned this pull request Oct 16, 2024

Lightning Specification Meeting 2024/11/04 #1206

Closed

20 tasks

t-bast mentioned this pull request Nov 22, 2024

Lightning Specification Meeting 2024/12/02 #1210

Open

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peer backup storage (feature 40/41/42/43) #881

Peer backup storage (feature 40/41/42/43) #881

t-bast commented Jun 28, 2021 •

edited

Loading

SomberNight commented Jun 29, 2021 •

edited

Loading

t-bast commented Jun 30, 2021

SomberNight commented Jun 30, 2021

t-bast commented Jun 30, 2021

t-bast commented Jul 7, 2021

rustyrussell commented Jul 19, 2021 •

edited

Loading

rustyrussell commented Jul 27, 2021

t-bast commented Aug 10, 2021

akumaigorodski commented Aug 23, 2021 •

edited

Loading

rustyrussell commented Mar 10, 2022

Peer backup storage (feature 40/41/42/43) #881

Are you sure you want to change the base?

Peer backup storage (feature 40/41/42/43) #881

Conversation

t-bast commented Jun 28, 2021 • edited Loading

SomberNight commented Jun 29, 2021 • edited Loading

t-bast commented Jun 30, 2021

SomberNight commented Jun 30, 2021

t-bast commented Jun 30, 2021

t-bast commented Jul 7, 2021

rustyrussell commented Jul 19, 2021 • edited Loading

rustyrussell commented Jul 27, 2021

t-bast commented Aug 10, 2021

akumaigorodski commented Aug 23, 2021 • edited Loading

rustyrussell commented Mar 10, 2022

t-bast commented Jun 28, 2021 •

edited

Loading

SomberNight commented Jun 29, 2021 •

edited

Loading

rustyrussell commented Jul 19, 2021 •

edited

Loading

akumaigorodski commented Aug 23, 2021 •

edited

Loading