[DNM] PIBD Task / Issue Tracker #3695

yeastplume · 2022-02-21T12:06:27Z

This PR is a task tracker for PIBD work, which also handily displays all PIBD implementation related changes on the pibd_impl branch thus far. Outstanding tasks or issues that need investigation are outlined below, followed by a list of recently completed (and thus far unreviewed PRs) on the pibd_impl branch. Note that considerable work was done on the segmentation and network messaging side previous to this round of work.

This posting will be kept up to date as work progresses, feel free to discuss, ask questions or raise issues in the comments.

General Progress

Code for Segmentation and desegmentation based on an archive block header is in place, as well as a first version of all related peer and network functions.
Sync code is in place that performs a PIBD reconstruction of the txhashset instead of the txhashset.zip download when running on testnet only.
As of [PIBD_IMPL] PIBD Stats + Retry on validation errors #3694, First rough and ready version is contained on the pibd_impl branch, and a node has successfully synced from scratch on testnet via PIBD.
As of [PIBD_IMPL] TxHashset Fallback Implementation #3704, I'm happy enough with the state of it and think it's progressed enough for review and merge into master (with testnet only-enabled at the moment, or possibly a config flag that defaults to running PIBD on testnet only)

RFCs

PIBD Messages

Outstanding issues

Completed (Mostly unreviewed but merged into `pibd_impl` branch)

* experimental addition of pibd download state for testnet only * fixes to bitmap number of segments calculation + conversion of bitmap accumulator to bitmap * attempt to call a test message * add p2p methods for receiving bitmap segment and applying to desegmenter associated with chain * fixes to state sync

…from each (#3686)

* add functions to desegmenter to report next desired segments, begin to add state to determine which segments have been requested * add segmentidentifier type to id requested segments uniquely * make a call on where to keep track of which PIBD segments have been requested * move segmenttype definition, add functions to manipulate peer segment list * remove desegmenter state enum * change chain desegmenter function to provide rwlock * trace, warning cleanup * udpate to test compliation

…ruction (#3689) * application of received bitmap segments to local accumulator * add all required elements to send/receive output segment requests and responses * testing of output sync * add special cases to pmmr segment request

* update pibd copy test to use new desgmenter structure * begin reconstruction of output pmmr * clean up hash/leaf insertion logic * push pruned subtree appears to be working, now also calculates left hand hashes correctly * factor out ordering of segment/hash order array * refactor for pmmr application code * test of chain copy appears to be working * add rangeproof functions to desegmenter * add kernel functions, attempt refactor * small test cleanup, reconstruction of live chain working in manual copy test

…3691) * add functions to determing latest verifiable block height for the given pibd state * attempting to allow for pibd to resume after killing process * fix to ensure prune list is properly flushed during pibd sync * removal of unneeded code * ignore test for now (fix before full merge)

…#3692) * investigations as to why a slight rewind is needed on startup during PIBD * move validation code into desegmenter validation thread (for now) * ensure genesis entries in pmmrs are removed if they're removed in the first segment * validation all working except for verifying kernel sums * remove unneeded pmmr rollbacks on resume now root cause was found * updates to remove unpruned leaves from leaf set when rebuilding pmmr * remove + 1 to segment traversal iter length

* start to add stats and reset chain state after errors detected * add functions to reset prune list when resetting chain pibd state * debug statement * remove test function

yeastplume · 2022-02-23T13:15:39Z

Some thinking and github archaeology around the 'pruning coupling' issue above:

Compaction originally removed all pruned leaves
Merkle proofs desired originally to allow spending of timelocked coinbase outputs without having access to the full block:
- [WIP] exploring how we might generate Merkle Proofs for PMMR #699
Merkle proof implementation here
- Merkle Proofs #716
- Spending a coinbase output requires the block hash and merkle proof to be provided in the input (to verify coinbase maturity without requiring full block)
In original implementation, where all pruned leaves were removed, was impossible to reliably generate a merkle proof for a coinbase after a rewind.
- Was decided that the solution was to change the check_compact function to maintain leaf siblings in remove log and underlying file, only purging when parents are removed
- [DNM] prune a PMMR and we cannot construct a Merkle proof (minimal example) #720
- check_compact retains leaves and roots until parents are pruned #753 (comment)
It seems here that the alternate solution here could have been to store hashes of pruned leaves with siblings instead of retaining all data.
In the current code, the wallet is not required to provide merkle proofs to spend coinbase outputs, the node checks their maturity by looking up the height in the chain instead. (Made possible as output_mmr_size now being commit to in block header)
- verify coinbase maturity via output_mmr_size #1203
So Merkle proofs and 'prune coupling' changes no longer needed at this stage.
However, merkle proofs are needed for PIBD, however these could also be generated from hashes instead of stored siblings. They are also beyond the rollback horizon so there shouldn't be any rewind concerns.
Other (possibly relevant) PRs:
- Improved (more compact) Merkle Proofs #1015
- PMMR segment creation and validation #3453
- Disable merkle proofs for v2 get_block #3487 (Merkle proofs not actively used by anyone)
Questions are:
- Will Merkle proofs be required in the 'rewindable' portion of the chain (for which original blocks are stored in the DB anyhow and from which hashes can be reconstructed if absolutely required)
- Are there any properties of a rewind or cases in which proofs can't be generated from hashes (or in a pinch, reconstructed from the block DB).
So to summarize:
- Coupling behaviour was implemented to allow for merkle proofs in all cases, with the intention that Merkle proofs would be used to validate maturity of coinbase outputs.
- Not clear to me whether decision to keep sibling data around as opposed to storing hashes for pruned sibling data was made for a reason other than ease of implementation.
- With output_mmr_size added to blockheader, coinbase maturity could be validated by nodes without wallets supplying Merkle proof.
- Merkle proofs unused by anything until the advent of PIBD.
- Rollback or rewind concerns won't be an issue with PIBD as it all happens beyond the horizon, so it's difficult to see why we can't just store and transmit a hash instead of complete leaf data for a pruned node with an unpruned sibling.
Based on above I can only really conclude that the 'pruning coupling' behaviour is legacy behaviour, which may have had less of an impact when the only adverse implication was (cheap) local node storage. However, with PIBD this choice affects the amount of network data that has to be sent, particularly unneeded rangeproofs.
PIBD segmenter sends these unpruned siblings in their segment data, and the recipient needs to apply and then immediately prune them based on their presence in the output bitmap.
The coupling behavior is fairly deep rooted in the implementation, and would be very time consuming with an RFC likely needed
Caveat that I could be missing something.

…3696) * cleanup of segment request list * allow for more simultaneous requests during state sync * up number of simultaneous peer requests for segments

DavidBurkett · 2022-02-27T03:22:42Z

Is this ready for review, or should I wait until the outstanding issues are resolved?

yeastplume · 2022-02-27T13:18:26Z

Is this ready for review, or should I wait until the outstanding issues are resolved?

The code as it stands is nowhere near ready for proper review, but everything about PIBD and all issues listed above are ripe for discussion and comment.

…pagation (#3698) * change pibd stat display to show progress as a percentage of downloaded leaves * attempt some inline rp validation * propagate shutdown state through kernel validation * change validation loop timing * simplify validator threading * add more detailed tracking of kernel history validation to tui, allow stop state during * adding more stop state + tui progress indication * remove progressive validate * test fix

* ensure desegmenter attempts to apply correct block after a resumte * ensure txhashset's committed implementation takes into account output bitmap for summing purposes * remove check to de-apply outputs during segment application * return removal of spent outputs during pibd * remove unneeded status * remove uneeded change to rewind function

* fix for writing / calculating incorrect length for negative indices * update capabilities with new version of PIBD hist * remove incorrect comment * fix capabilities flag, trace output * test fix

* update Cargo.lock for next release * visibility scope tweaks to aid seed test utilities (#3707)

tromp · 2022-10-17T09:12:54Z

I think this is ready for merging into master, with PIBD used only for testnet (unless custom compiled). We'll encourage more people to run testnet nodes, and stop them from time to time for a few weeks. Maybe with regular forum posts to remind people.

* [PIBD_IMPL] Introduce PIBD state into sync workflow (mimblewimble#3685) * experimental addition of pibd download state for testnet only * fixes to bitmap number of segments calculation + conversion of bitmap accumulator to bitmap * attempt to call a test message * add p2p methods for receiving bitmap segment and applying to desegmenter associated with chain * fixes to state sync * add pibd receive messages to network, and basic calls to desegmenter from each (mimblewimble#3686) * [PIBD_IMPL] PIBD Desegmenter State (mimblewimble#3688) * add functions to desegmenter to report next desired segments, begin to add state to determine which segments have been requested * add segmentidentifier type to id requested segments uniquely . . . and more . . .

yeastplume added 8 commits January 12, 2022 13:02

add pibd receive messages to network, and basic calls to desegmenter …

009a02e

…from each (#3686)

[PIBD_IMPL] PIBD Stats + Retry on validation errors (#3694)

5630cf2

* start to add stats and reset chain state after errors detected * add functions to reset prune list when resetting chain pibd state * debug statement * remove test function

yeastplume changed the title ~~[DNM] PIBD Task Tracker~~ [DNM] PIBD Task / Issue Tracker Feb 21, 2022

[PIBD_IMPL] Update number of simultaneous peer requests for segments (#…

bf48e52

…3696) * cleanup of segment request list * allow for more simultaneous requests during state sync * up number of simultaneous peer requests for segments

yeastplume added 10 commits March 1, 2022 13:52

revert to previous method of applying segments (#3699)

b08a6dd

fix for deadlock issue (#3700)

09d6f41

update Cargo.lock for next release

50450ba

documentation updates + todo fixes (#3703)

eda31ab

add pibd abort timeout case (#3704)

aa2a2a9

[PIBD_IMPL] BitmapAccumulator Serialization Fix (#3705)

5efd70a

* fix for writing / calculating incorrect length for negative indices * update capabilities with new version of PIBD hist * remove incorrect comment * fix capabilities flag, trace output * test fix

Merge DNSSeed scope changes into pibd impl branch (#3708)

41f3aaf

* update Cargo.lock for next release * visibility scope tweaks to aid seed test utilities (#3707)

move all PIBD-related constants into pibd_params modules (#3711)

a441b78

yeastplume mentioned this pull request Jun 10, 2022

PIBD Deployment mimblewimble/grin-rfcs#89

Merged

yeastplume added 3 commits July 14, 2022 11:56

merge from master (thiserror conversion update)

e13c9d1

Merge branch 'master' into pibd_impl

6412fd1

remove potential double read lock during compaction

3524b70

yeastplume merged commit 030bd0e into master Oct 18, 2022

phyro mentioned this pull request Feb 24, 2023

file descriptor leak #3729

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DNM] PIBD Task / Issue Tracker #3695

[DNM] PIBD Task / Issue Tracker #3695

yeastplume commented Feb 21, 2022 •

edited

Loading

yeastplume commented Feb 23, 2022 •

edited

Loading

DavidBurkett commented Feb 27, 2022

yeastplume commented Feb 27, 2022

tromp commented Oct 17, 2022

[DNM] PIBD Task / Issue Tracker #3695

[DNM] PIBD Task / Issue Tracker #3695

Conversation

yeastplume commented Feb 21, 2022 • edited Loading

General Progress

RFCs

Outstanding issues

Completed (Mostly unreviewed but merged into pibd_impl branch)

yeastplume commented Feb 23, 2022 • edited Loading

DavidBurkett commented Feb 27, 2022

yeastplume commented Feb 27, 2022

tromp commented Oct 17, 2022

yeastplume commented Feb 21, 2022 •

edited

Loading

Completed (Mostly unreviewed but merged into `pibd_impl` branch)

yeastplume commented Feb 23, 2022 •

edited

Loading