Handle monitor update failures in two more places + new fuzz test #290

TheBlueMatt · 2019-01-13T23:24:16Z

Based on #285, #286, and #288, this adds handling of monitor update failures in two more places, with some trivial tests to check sanity thereof. It also finally adds the fuzz test which found the issues in #286 and #288, though it really wants rust-bitcoin/rust-secp256k1#89 to fully test things. This leaves only three non-fee-handling unimplemented!()s in ChannelManager (and they're all during channel setup, so are much easier to handling).

ariard · 2019-01-21T00:22:46Z

src/ln/channelmanager.rs

@@ -2601,6 +2601,25 @@ impl ChannelMessageHandler for ChannelManager {
 					true
 				})
 			}
+			pending_msg_events.retain(|msg| {


Hmm wandering, if client send stale message before channel_reestablish ones, isn't this mean that a code path is wrong somewhere ? Or it's not something we can be sure of because of the library architecture ?

Not strictly. In theory a user could get a disconnect from a peer, reconnect to that peer, and then process events, seeing the message send events for a given node_id (and not connection). I'm not sure if there aren't other related races, but at least this improves things.

Okay. Yes sure better to be careful and it cost nothing there. Moreover if user doesn't take our peer_handler, it may break some of our assumptions.

ariard

More learnt than reviewed, monitor failures interaction with channel state machine are really interesting.
As you said, there is only left pre-ChannelFunded unimplemented! cases (and fee). Had a look to be sure, monitor update failure doesn't interfere with closing phase, at least after there is no HLTCs left on commiment_transaction. We can add a little test with a failure after shutdown have been exchanged on both side, but should be alright.

ariard · 2019-01-21T01:08:13Z

src/ln/channelmanager.rs

-								commitment_signed: commitment_msg,
-							},
-						});
+					} else {


Build without unreachable branch (1.22.0)

Sure, it will build without it, but having it in there means if there is a refactor in the future that breaks things we'll see panics instead of losing funds :p.

Ah astute! Maybe add a comment there with your answer

I'm actually gonna leave it. We use this pattern in a few places, and hopefully "unreachable!()" is rather self-documenting. In general, across the library, we'd prefer to panic than have a potentially money-losing bug, so this isn't particularly out of the ordinary.

ariard · 2019-01-21T01:09:41Z

src/ln/channelmanager.rs

-						//TODO: Do something with e?
-						return
-					},
+					} else { unreachable!(); }


Build without unreachable branch (1.22.0)

src/ln/channelmanager.rs

Best reviewed with -b

This shouldn't be required, but it may help prevent some downstream race conditions due to clients not sending message events quickly enough and trying to send stale messages before new channel_reestablish messages.

This is an oversight as the MessageSendEvent is otherwise entirely useless.

Sadly this requires reducing the honggfuzz iterations to fit within Travis' runtime limits.

TheBlueMatt changed the title ~~2019 01 monitor update handle fuzz~~ Handle monitor update failures in two more places + new fuzz test Jan 13, 2019

TheBlueMatt force-pushed the 2019-01-monitor-update-handle-fuzz branch from 691bf82 to 72dd991 Compare January 15, 2019 21:43

TheBlueMatt mentioned this pull request Jan 18, 2019

Upgrade to secp256k1 v12, bitcoin v16, and crates bitcoin_hashes #294

Merged

ariard reviewed Jan 21, 2019

View reviewed changes

TheBlueMatt added this to the 0.0.8 milestone Jan 21, 2019

TheBlueMatt force-pushed the 2019-01-monitor-update-handle-fuzz branch 9 times, most recently from 4a2c20d to bfe9f1d Compare January 24, 2019 16:51

TheBlueMatt added 5 commits January 24, 2019 13:16

Handle monitor update failures in two more places

a138a9a

Best reviewed with -b

Drop pending outbound messages on peer disconnection

1bc190c

This shouldn't be required, but it may help prevent some downstream race conditions due to clients not sending message events quickly enough and trying to send stale messages before new channel_reestablish messages.

Expose CommitmentUpdate contents

9a72207

This is an oversight as the MessageSendEvent is otherwise entirely useless.

Take the logger from test_utils into fuzz::test_utils

aa9a848

Add a fuzz target to test monitor update failure handling

49d6330

Sadly this requires reducing the honggfuzz iterations to fit within Travis' runtime limits.

TheBlueMatt force-pushed the 2019-01-monitor-update-handle-fuzz branch from bfe9f1d to 49d6330 Compare January 24, 2019 18:19

TheBlueMatt merged commit 4ccb1e4 into lightningdevkit:master Jan 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle monitor update failures in two more places + new fuzz test #290

Handle monitor update failures in two more places + new fuzz test #290

TheBlueMatt commented Jan 13, 2019

ariard Jan 21, 2019

TheBlueMatt Jan 21, 2019

ariard Jan 21, 2019

ariard left a comment

ariard Jan 21, 2019

TheBlueMatt Jan 21, 2019

ariard Jan 21, 2019

TheBlueMatt Jan 22, 2019

ariard Jan 21, 2019

Handle monitor update failures in two more places + new fuzz test #290

Handle monitor update failures in two more places + new fuzz test #290

Conversation

TheBlueMatt commented Jan 13, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ariard left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment