lnd+walletunlocker: make unlock/init operations synchronous #4349

Roasbeef · 2020-06-03T02:52:08Z

In this commit, we fix a deadlock that can happen if a user attempts to
init then rapidly unlock a wallet right after. In my profiles, it seems
the lnd gets caught up on the bbolt flock, which deadlocks the entire
process. We fix this issue by making the Init/Unlock calls now fully
synchronous. Only a single outstanding request can exist across the
entire wallet unlocker service now.

Fixes #4340.

Fixes #3631.

Roasbeef · 2020-06-03T05:18:02Z

As is, this makes unlock/init fully synchronous. If we want to leave it as async, then a goroutine can be added in the unlocker service to release the mutex in the background after the operation is complete.

wpaulino · 2020-06-04T01:14:42Z

walletunlocker/service.go

@@ -313,6 +342,9 @@ func (u *UnlockerService) InitWallet(ctx context.Context,
 func (u *UnlockerService) UnlockWallet(ctx context.Context,
 	in *lnrpc.UnlockWalletRequest) (*lnrpc.UnlockWalletResponse, error) {

+	u.Lock()
+	defer u.Unlock()


I think we may also need another signal/bool at the UnlockerService level that indicates the wallet has been initialized/unlocked so that we can check it here and everywhere else, otherwise it seems like we still risk attempting to open the wallet twice.

What do you mean? Push things down even further? With where things are atm, we won't return back to the caller until the wallet has been fully initialized. However, for unlock we return a bit earlier once we have all the credentials. In my testing, the concurrent init was was ended up tripping things up.

For unlock, things only become "fully finalized" once we create the chain control.

Actually, as soon as this method returns, unlocking isn't even possible since the listener service only has a lifetime of this call.

walletunlocker/service.go

walletunlocker/service_test.go

In this commit, we fix a deadlock that can happen if a user attempts to init then rapidly unlock a wallet right after. In my profiles, it seems the lnd gets caught up on the bbolt flock, which deadlocks the entire process. We fix this issue by making the Init/Unlock calls now fully synchronous. Only a single outstanding request can exist across the entire wallet unlocker service now. Fixes lightningnetwork#4330. Fixes lightningnetwork#3631.

cfromknecht · 2020-06-10T10:41:19Z

walletunlocker/service_test.go

@@ -472,7 +504,20 @@ func TestChangeWalletPassword(t *testing.T) {
 			t.Fatalf("expected to receive password %x, got %x",
 				testPassword, unlockMsg.Passphrase)
 		}
+
+		// We'll now close the done channel to unlock the service to be
+		// able to accept another request.


from your comment on the pr it seems the unlocker can only process one request, but several comments in the code refer to processing more requests. is it the case that the wallet unlocker could process more requests, but we only allow one due to the defer cancel() in waitForWalletPassword?

cfromknecht · 2020-06-10T10:46:58Z

walletunlocker/service.go

@@ -302,7 +323,19 @@ func (u *UnlockerService) InitWallet(ctx context.Context,
 		initMsg.ChanBackups = *chansToRestore
 	}

-	u.InitMsgs <- initMsg
+	select {


add helper to avoid having 3 copies of the same code?

guess there are really two versions, one for init and one for unlock...

wpaulino · 2020-06-10T23:31:30Z

walletunlocker/service.go

-	u.UnlockMsgs <- walletUnlockMsg
+	select {
+	case u.UnlockMsgs <- walletUnlockMsg:
+	case <-u.quitChan:


This is never closed.

Ah I see, the main shutdown chan is passed to it.

wpaulino · 2020-06-10T23:37:42Z

lnd.go

@@ -1124,6 +1130,13 @@ func waitForWalletPassword(cfg *Config, restEndpoints []net.Addr,
 	// The wallet has already been created in the past, and is simply being
 	// unlocked. So we'll just return these passphrases.
 	case unlockMsg := <-pwService.UnlockMsgs:
+


I was still able to reproduce the issue with the current changes. The issue doesn't seem to be related to the init call being asynchronous, but rather with the UnlockerService having a queued UnlockWallet call that it doesn't cancel after InitWallet has been called. If we had a UnlockerService.Quit method, we could call it here and check that it's been closed after the lock has been acquired at the UnlockerService level.

Roasbeef · 2021-03-11T19:57:25Z

Replaced by #4985

Roasbeef added rpc Related to the RPC interface wallet The wallet (lnwallet) which LND uses bug fix v0.11 labels Jun 3, 2020

Roasbeef added this to the 0.11.0 milestone Jun 3, 2020

Roasbeef requested review from wpaulino and carlaKC June 3, 2020 02:52

wpaulino reviewed Jun 4, 2020

View reviewed changes

carlaKC reviewed Jun 4, 2020

View reviewed changes

walletunlocker/service.go Show resolved Hide resolved

walletunlocker/service_test.go Show resolved Hide resolved

Roasbeef requested review from carlaKC and wpaulino June 9, 2020 21:09

Roasbeef force-pushed the sync-wallet-unlock branch from 3b4f025 to d533851 Compare June 9, 2020 21:09

cfromknecht reviewed Jun 10, 2020

View reviewed changes

wpaulino reviewed Jun 10, 2020

View reviewed changes

Roasbeef modified the milestones: 0.11.0, 0.12.0 Jul 2, 2020

Roasbeef removed the v0.11 label Jul 2, 2020

Roasbeef modified the milestones: 0.12.0, 0.13.0 Nov 4, 2020

Roasbeef added P2 should be fixed if one has time P1 MUST be fixed or reviewed labels Jan 28, 2021

cfromknecht added the v0.13 label Feb 18, 2021

halseth removed the P1 MUST be fixed or reviewed label Feb 25, 2021

Roasbeef closed this Mar 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lnd+walletunlocker: make unlock/init operations synchronous #4349

lnd+walletunlocker: make unlock/init operations synchronous #4349

Roasbeef commented Jun 3, 2020 •

edited by wpaulino

Loading

Roasbeef commented Jun 3, 2020

wpaulino Jun 4, 2020

Roasbeef Jun 9, 2020

Roasbeef Jun 9, 2020

cfromknecht Jun 10, 2020

cfromknecht Jun 10, 2020

cfromknecht Jun 10, 2020

wpaulino Jun 10, 2020

wpaulino Jun 10, 2020

wpaulino Jun 10, 2020

Roasbeef commented Mar 11, 2021

lnd+walletunlocker: make unlock/init operations synchronous #4349

lnd+walletunlocker: make unlock/init operations synchronous #4349

Conversation

Roasbeef commented Jun 3, 2020 • edited by wpaulino Loading

Roasbeef commented Jun 3, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Roasbeef commented Mar 11, 2021

Roasbeef commented Jun 3, 2020 •

edited by wpaulino

Loading