Only sync with channel peers #1587

t-bast · 2020-11-05T14:55:01Z

Our previous behaviour was to do a full routing table sync whenever we connected to a node (or a node connected to us).
This was wasting a lot of bandwidth for no reason.

Instead, we only sync with nodes that are either:

in our sync whitelist
have channels with us

On top of that, we trigger a sync whenever our first channel is opened with a new peer: this ensures we following flow works:

connect to a new node
we don't have channels together yet, so we're not syncing
open a channel
once the channel is open, sync the routing table
we can now make payments (yay!)

The syncing state is a bit hard to read: we've started discussions in the spec to make changes to channel queries (see lightning/bolts#804 and lightning/bolts#811), it will be a good opportunity to refactor the code later.

eclair-core/src/main/scala/fr/acinq/eclair/io/Peer.scala

eclair-core/src/main/scala/fr/acinq/eclair/remote/EclairInternalsSerializer.scala

codecov-io · 2020-12-08T18:08:36Z

Codecov Report

Merging #1587 (5d388ad) into master (34e901d) will decrease coverage by 0.01%.
The diff coverage is 93.33%.

@@            Coverage Diff             @@
##           master    #1587      +/-   ##
==========================================
- Coverage   85.93%   85.91%   -0.02%     
==========================================
  Files         151      151              
  Lines       11430    11452      +22     
  Branches      497      488       -9     
==========================================
+ Hits         9822     9839      +17     
- Misses       1608     1613       +5

Impacted Files	Coverage Δ
...cinq/eclair/remote/EclairInternalsSerializer.scala	`97.43% <ø> (ø)`
...a/fr/acinq/eclair/wire/LightningMessageTypes.scala	`86.36% <ø> (ø)`
...main/scala/fr/acinq/eclair/router/Validation.scala	`90.76% <33.33%> (-1.90%)`	⬇️
...src/main/scala/fr/acinq/eclair/router/Router.scala	`93.58% <50.00%> (+0.03%)`	⬆️
...main/scala/fr/acinq/eclair/io/PeerConnection.scala	`81.89% <81.81%> (-0.04%)`	⬇️
...-core/src/main/scala/fr/acinq/eclair/io/Peer.scala	`89.83% <100.00%> (+0.16%)`	⬆️
...rc/main/scala/fr/acinq/eclair/io/Switchboard.scala	`81.08% <100.00%> (+2.95%)`	⬆️
...e/src/main/scala/fr/acinq/eclair/router/Sync.scala	`98.36% <100.00%> (+0.06%)`	⬆️
...nq/eclair/blockchain/electrum/ElectrumWallet.scala	`80.25% <0.00%> (-0.26%)`	⬇️
... and 1 more

pm47

Just a first pass.

eclair-core/src/main/scala/fr/acinq/eclair/io/Switchboard.scala

eclair-core/src/main/scala/fr/acinq/eclair/router/Router.scala

eclair-core/src/main/scala/fr/acinq/eclair/router/Sync.scala

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala

eclair-core/src/main/scala/fr/acinq/eclair/router/Sync.scala

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala

eclair-core/src/main/scala/fr/acinq/eclair/router/Router.scala

This reduces the bandwidth used: it doesn't make sense to sync with every node that connects to us. We also better track sync requests, to reject unsolicited sync responses. To ensure that nodes don't need to explicitly reconnect after creating their first channel in order to get the routing table, we add a mechanism to trigger a sync when the first channel is created.

If our peer doesn't support gossip queries, we'll have sync-ed with him with `initial_routing_sync` right after `init`.

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerEvents.scala

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala

eclair-core/src/main/scala/fr/acinq/eclair/remote/EclairInternalsSerializer.scala

eclair-core/src/main/scala/fr/acinq/eclair/io/Switchboard.scala

eclair-core/src/main/scala/fr/acinq/eclair/router/Router.scala

pm47 · 2021-02-03T15:50:46Z

eclair-core/src/main/scala/fr/acinq/eclair/router/Sync.scala

+      case Some(currentSync) if currentSync.remainingQueries.isEmpty && r.shortChannelIds.array.isEmpty =>
+        log.info("received empty reply_channel_range, sync is complete")
+        d.copy(sync = d.sync - origin.nodeId)


This isn't how the spec indicates that we should detect the last reply_channel_range message:

the final reply_channel_range message:
MUST have first_blocknum plus number_of_blocks equal or greater than the query_channel_range first_blocknum plus number_of_blocks.

Right, this is probably a leftover made to handle nodes that didn't correctly signal termination?
I'll look into it.
Note that we'll be bringing back the complete field which will greatly simplify our lives.

This case doesn't handle the normal termination of reply_channel_range, it deals with a special case where the first reply_channel_range is empty (meaning the remote node doesn't want to send us anything basically). Otherwise we'll have something in remainingQueries and we'll wait for these to complete before we consider the sync done. I had to add this because we have a test case for it.

An interesting note is that we don't detect the end of a reply_channel_range at all: we rely on reply_short_channel_ids_end and the fact that our remainingQueries is empty. That's different from the spec and should be fixed. In practice it will very likely work (we've always had this behavior and we've been fine) and it's orthogonal to this PR since it was our previous behavior. I think that can be fixed in a later PR (and we can wait to rely on the complete field for a cleaner signal that the sync is complete once we correctly set it again).

Maybe you should include your reply as a TODO note here?

Done in 8f292b9 and added to my todo list.

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala

pm47 · 2021-02-04T13:38:55Z

eclair-core/src/main/scala/fr/acinq/eclair/router/Sync.scala

+      case Some(currentSync) if currentSync.remainingQueries.isEmpty && r.shortChannelIds.array.isEmpty =>
+        log.info("received empty reply_channel_range, sync is complete")
+        d.copy(sync = d.sync - origin.nodeId)


Maybe you should include your reply as a TODO note here?

t-bast commented Nov 5, 2020

View reviewed changes

eclair-core/src/main/scala/fr/acinq/eclair/io/Peer.scala Outdated Show resolved Hide resolved

t-bast requested a review from sstone November 5, 2020 14:56

t-bast force-pushed the sync-nodes-with-channels branch from 2aae03f to d1261e9 Compare December 8, 2020 17:54

t-bast commented Dec 8, 2020

View reviewed changes

eclair-core/src/main/scala/fr/acinq/eclair/remote/EclairInternalsSerializer.scala Show resolved Hide resolved

t-bast requested a review from pm47 December 16, 2020 07:55

pm47 reviewed Dec 16, 2020

View reviewed changes

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala Outdated Show resolved Hide resolved

eclair-core/src/main/scala/fr/acinq/eclair/router/Sync.scala Outdated Show resolved Hide resolved

t-bast mentioned this pull request Jan 7, 2021

Correctly set gossip sync_complete #1668

Merged

t-bast force-pushed the sync-nodes-with-channels branch from daf7bbf to cceb084 Compare January 18, 2021 11:36

sstone reviewed Jan 18, 2021

View reviewed changes

eclair-core/src/main/scala/fr/acinq/eclair/io/PeerConnection.scala Show resolved Hide resolved

eclair-core/src/main/scala/fr/acinq/eclair/router/Router.scala Show resolved Hide resolved

sstone previously approved these changes Jan 22, 2021

View reviewed changes

t-bast added 3 commits January 22, 2021 11:19

First channel sync only with gossip queries

8255765

If our peer doesn't support gossip queries, we'll have sync-ed with him with `initial_routing_sync` right after `init`.

Switchboard remove peer when last channel closes

3b6819f

t-bast dismissed sstone’s stale review via 3b6819f January 22, 2021 10:20

t-bast force-pushed the sync-nodes-with-channels branch from 2b5c079 to 3b6819f Compare January 22, 2021 10:20

pm47 reviewed Feb 3, 2021

View reviewed changes

t-bast added 3 commits February 3, 2021 17:21

PeerLastChannelClosed -> LastChannelClosed

a632b71

More functional Switchboard internals

c87bbe4

Remove duplicate sync code in PeerConnection

ea4f2a3

t-bast mentioned this pull request Feb 4, 2021

Remove support for initial_routing_sync #1683

Merged

pm47 previously approved these changes Feb 4, 2021

View reviewed changes

nits

8f292b9

t-bast dismissed pm47’s stale review via 8f292b9 February 4, 2021 14:12

pm47 approved these changes Feb 4, 2021

View reviewed changes

t-bast merged commit ac054a2 into master Feb 4, 2021

t-bast deleted the sync-nodes-with-channels branch February 4, 2021 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only sync with channel peers #1587

Only sync with channel peers #1587

t-bast commented Nov 5, 2020 •

edited

Loading

codecov-io commented Dec 8, 2020 •

edited

Loading

pm47 left a comment

pm47 Feb 3, 2021

t-bast Feb 3, 2021

t-bast Feb 3, 2021

pm47 Feb 4, 2021

t-bast Feb 4, 2021

pm47 Feb 4, 2021

Only sync with channel peers #1587

Only sync with channel peers #1587

Conversation

t-bast commented Nov 5, 2020 • edited Loading

codecov-io commented Dec 8, 2020 • edited Loading

Codecov Report

pm47 left a comment

Choose a reason for hiding this comment

pm47 Feb 3, 2021

Choose a reason for hiding this comment

t-bast Feb 3, 2021

Choose a reason for hiding this comment

t-bast Feb 3, 2021

Choose a reason for hiding this comment

pm47 Feb 4, 2021

Choose a reason for hiding this comment

t-bast Feb 4, 2021

Choose a reason for hiding this comment

pm47 Feb 4, 2021

Choose a reason for hiding this comment

t-bast commented Nov 5, 2020 •

edited

Loading

codecov-io commented Dec 8, 2020 •

edited

Loading