ChainDB: add blocks asynchronously #1709

mrBliss · 2020-02-27T12:39:56Z

Instead of adding blocks synchronously, they are now put into a queue, after
which addBlockAsync returns an AddBlockResult, which can be used to wait
until the block has been processed.

A background thread will read the blocks from the queue and add them
synchronously to the ChainDB. The queue is limited in size; when it is full,
callers of addBlockAsync might still have to wait.

With this asynchronous approach, threads adding blocks asynchronously can be
killed without worries, the background thread processing the blocks
synchronously won't be killed. Only when the whole ChainDB shuts down will
that background thread get killed. But since there will be no more in-memory
state, it can't get out of sync with the file system state. On the next
startup, a correct in-memory state will be reconstructed from the file system
state.

By letting the BlockFetchClient add blocks asynchronously, we also get a
20-40% bulk chain sync speed-up in some microbenchmarks.

See `lengthTBQueueDefault` and `MonadSTMTxExtended` for more information.

ouroboros-consensus/src/Ouroboros/Consensus/Storage/ChainDB/API.hs

ouroboros-consensus/src/Ouroboros/Consensus/Storage/ChainDB/Impl/Types.hs

ouroboros-consensus/src/Ouroboros/Consensus/Storage/ChainDB/Impl/ChainSel.hs

ouroboros-consensus/src/Ouroboros/Consensus/Storage/ChainDB/Impl/Args.hs

Fixes #1463. Instead of adding blocks synchronously, they are now put into a queue, after which `addBlockAsync` returns an `AddBlockPromise`, which can be used to wait until the block has been processed. A background thread will read the blocks from the queue and add them synchronously to the ChainDB. The queue is limited in size; when it is full, callers of `addBlockAsync` might still have to wait. With this asynchronous approach, threads adding blocks asynchronously can be killed without worries, the background thread processing the blocks synchronously won't be killed. Only when the whole ChainDB shuts down will that background thread get killed. But since there will be no more in-memory state, it can't get out of sync with the file system state. On the next startup, a correct in-memory state will be reconstructed from the file system state. By letting the BlockFetchClient add blocks asynchronously, we also get a 20-40% bulk chain sync speed-up in some microbenchmarks.

edsko

Reviewed over hangout (made comments off-line as GitHub was having difficulties).

edsko · 2020-02-27T15:55:26Z

bors r+

1709: ChainDB: add blocks asynchronously r=edsko a=mrBliss Fixes #1463. Instead of adding blocks synchronously, they are now put into a queue, after which `addBlockAsync` returns an `AddBlockResult`, which can be used to wait until the block has been processed. A background thread will read the blocks from the queue and add them synchronously to the ChainDB. The queue is limited in size; when it is full, callers of `addBlockAsync` might still have to wait. With this asynchronous approach, threads adding blocks asynchronously can be killed without worries, the background thread processing the blocks synchronously won't be killed. Only when the whole ChainDB shuts down will that background thread get killed. But since there will be no more in-memory state, it can't get out of sync with the file system state. On the next startup, a correct in-memory state will be reconstructed from the file system state. By letting the BlockFetchClient add blocks asynchronously, we also get a 20-40% bulk chain sync speed-up in some microbenchmarks. Co-authored-by: Thomas Winant <[email protected]>

iohk-bors · 2020-02-27T16:12:10Z

Build succeeded

mrBliss · 2020-02-27T17:40:27Z

This PR actually got merged, but because of GitHub's recent reliability problems, this PR is not aware of that.

Previously, we knew the current slot and were able to tell that a block was from the future by comparing the block's slot against the current slot. For such blocks we would schedule a chain selection at the block's slot, which would be performed by a background thread. Now, we no longer know the current slot. Instead, we validate candidate chains and use the resulting ledgers to call `CheckInFuture`, which returns the headers in the candidate fragment that are from the future. We truncate these headers from the fragment, record that they're from the future (`cdbFutureBlocks`), and repeat chain selection without them. Headers that are too far from the future, i.e., exceeding the max clock skew, are recorded as invalid blocks (with `InFutureExceedsClockSkew` as the `InvalidBlockReason`). For each new block we receive, we perform chain selection for all future blocks before performing chain selection for the new block. * Split off `CandidateSuffix` into a separate module and use it throughout chain selection instead of only partially. A `CandidateSuffix` is the number of headers to roll back the current chain + a fragment containing the new headers to add, like a diff w.r.t. the current chain. Previously, we converted such a `CandidateSuffix` to a `ChainAndLedger`, i.e., a fragment starting from the immutable tip (typically containing >= k headers) + a ledger matching the tip. Now, we stick to the `CandidateSuffix` until the end, when we actually install the candidate as the new chain by applying the diff. Also introduce `CandidateSuffixAndLedger` and use that instead of `ChainAndLedger` for the validated candidate. We still use `ChainAndLedger` for the current chain. * Simplify `trySwitchTo` because there is no concurrency thanks to the queue introduced in #1709. Remove the obsolete trace message `ChainChangedInBg`. * New trace messages: - `ChainSelectionForFutureBlock` - `CandidateContainsFutureBlocks` - `CandidateContainsFutureBlocksExceedingClockSkew` * Remove `chainSelectionPerformed` from `AddBlockPromise` as it was not really used and complicated our new handling of blocks from the future.

Previously, we knew the current slot and were able to tell that a block was from the future by comparing the block's slot against the current slot. For such blocks we would schedule a chain selection at the block's slot, which would be performed by a background thread. Now, we no longer know the current slot. Instead, we validate candidate chains and use the resulting ledgers to call `CheckInFuture`, which returns the headers in the candidate fragment that are from the future. We truncate these headers from the fragment, record that they're from the future (`cdbFutureBlocks`), and repeat chain selection without them. Headers that are too far into the future, i.e., exceeding the max clock skew, are not recorded in `cdbFutureBlocks`, but are recorded as invalid blocks (with `InFutureExceedsClockSkew` as the `InvalidBlockReason`). For each new block we receive, we perform chain selection for all future blocks before performing chain selection for the new block. * Rename `CandidateSuffix` to `ChainDiff`, split it off into a separate module, and use it throughout chain selection instead of only partially. A `ChainDiff` is the number of headers to roll back the current chain + a fragment containing the new headers to add, i.e., a diff w.r.t. the current chain. Previously, we converted such a `ChainDiff` to a `ChainAndLedger`, i.e., a fragment starting from the immutable tip (typically containing >= k headers) + a ledger matching the tip. Now, we stick to the `ChainDiff` until the end, when we actually install the candidate as the new chain by applying the diff. Also introduce `ValidatedChainDiff` and use that instead of `ChainAndLedger` for the validated candidate. We still use `ChainAndLedger` for the current chain. * Simplify `trySwitchTo` because there is no concurrency thanks to the queue introduced in #1709. Remove the obsolete trace message `ChainChangedInBg`. * New trace messages: - `ChainSelectionForFutureBlock` - `CandidateContainsFutureBlocks` - `CandidateContainsFutureBlocksExceedingClockSkew` * Remove `chainSelectionPerformed` from `AddBlockPromise` as it was not really used and complicated our new handling of blocks from the future. * Don't mark successors of an invalid block as invalid, as this is redundant, see why in `ChainDB.md`. This means we remove the `InChainAfterInvalidBlock` constructor of `InvalidBlockReason`. * Introduce `ChainSelEnv` to reduce the number of parameters to pass around.

1709: ChainDB: add blocks asynchronously r=edsko a=mrBliss Fixes #1463. Instead of adding blocks synchronously, they are now put into a queue, after which `addBlockAsync` returns an `AddBlockResult`, which can be used to wait until the block has been processed. A background thread will read the blocks from the queue and add them synchronously to the ChainDB. The queue is limited in size; when it is full, callers of `addBlockAsync` might still have to wait. With this asynchronous approach, threads adding blocks asynchronously can be killed without worries, the background thread processing the blocks synchronously won't be killed. Only when the whole ChainDB shuts down will that background thread get killed. But since there will be no more in-memory state, it can't get out of sync with the file system state. On the next startup, a correct in-memory state will be reconstructed from the file system state. By letting the BlockFetchClient add blocks asynchronously, we also get a 20-40% bulk chain sync speed-up in some microbenchmarks. Co-authored-by: Thomas Winant <[email protected]>

mrBliss added 2 commits February 27, 2020 13:17

IOLike: re-export link

9232004

Add lengthTBQueue

096a612

See `lengthTBQueueDefault` and `MonadSTMTxExtended` for more information.

mrBliss added the consensus issues related to ouroboros-consensus label Feb 27, 2020

mrBliss requested review from edsko and dcoutts February 27, 2020 12:39

mrBliss commented Feb 27, 2020

View reviewed changes

edsko mentioned this pull request Feb 27, 2020

Avoid deserialization of blocks in BlockFetchClient IntersectMBO/ouroboros-consensus#709

Open

mrBliss mentioned this pull request Dec 1, 2023

Chain DB: use addBlock queue as BlockCache IntersectMBO/ouroboros-consensus#708

Open

mrBliss force-pushed the mrBliss/chaindb-async-addblock branch from 8f9e6d7 to 6996372 Compare February 27, 2020 13:17

mrBliss commented Feb 27, 2020

View reviewed changes

edsko approved these changes Feb 27, 2020

View reviewed changes

mrBliss closed this Feb 27, 2020

mrBliss deleted the mrBliss/chaindb-async-addblock branch February 27, 2020 17:40

mrBliss mentioned this pull request Mar 11, 2020

ImmutableDB: use TempResourceRegistry in modifyOpenState #1787

Merged

nfrisby mentioned this pull request Mar 24, 2020

Refine the BF logic for updating the PeerFetchState with async block adds #1845

Closed

mrBliss mentioned this pull request Apr 27, 2020

Handle async exceptions better #1681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChainDB: add blocks asynchronously #1709

ChainDB: add blocks asynchronously #1709

mrBliss commented Feb 27, 2020

edsko left a comment

edsko commented Feb 27, 2020

iohk-bors bot commented Feb 27, 2020

mrBliss commented Feb 27, 2020

ChainDB: add blocks asynchronously #1709

ChainDB: add blocks asynchronously #1709

Conversation

mrBliss commented Feb 27, 2020

edsko left a comment

Choose a reason for hiding this comment

edsko commented Feb 27, 2020

iohk-bors bot commented Feb 27, 2020

Build succeeded

mrBliss commented Feb 27, 2020