The output_pos index not being cleaned up correctly during compaction #2606

antiochp · 2019-02-19T16:53:40Z

We read raw bytes from the data file during the pruning/compaction process and use these bytes to clean the output_pos index up.

For each pruned chunk of bytes we attempt to call delete_output_pos but these bytes do not simply represent a commitment, they represent an output_identifier.
The chunk of bytes is actually empty...
It is not actually safe to do this as we can have duplicate outputs in the TXO set if one is spent and one is unspent. And in this scenario it is not safe to clean the output_pos index up based purely on spent outputs.

The text was updated successfully, but these errors were encountered:

antiochp · 2019-02-19T17:04:29Z

I'm kind of tempted to get rid of the callback mechanism entirely and just brute force it via rebuild_index every time we compact the chain.

See #2607

Or do something like delete_peers() in p2p store where we iterate over entries in the db based on prefix and remove those that meet some defined criteria (in this case spent outputs).

This isn't really any more "brute force" than how we currently iterate over the entire data file pruning as we go.
We'd basically just do two passes, one to prune the file and then another pass over the output_pos index (which grows with the UTXO set) to check if the entries are still valid and removing them where necessary.

antiochp added the bug label Feb 19, 2019

antiochp added this to the 1.0.2 milestone Feb 19, 2019

antiochp self-assigned this Feb 19, 2019

This was referenced Feb 20, 2019

[WIP] problem identified with clean_output_index callback #2603

Closed

Simplify (and fix) output_pos cleanup during chain compaction #2609

Merged

ignopeverell modified the milestones: 1.0.2, 1.0.3 Feb 25, 2019

antiochp closed this as completed in #2609 Feb 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The output_pos index not being cleaned up correctly during compaction #2606

The output_pos index not being cleaned up correctly during compaction #2606

antiochp commented Feb 19, 2019

antiochp commented Feb 19, 2019 •

edited

Loading

The output_pos index not being cleaned up correctly during compaction #2606

The output_pos index not being cleaned up correctly during compaction #2606

Comments

antiochp commented Feb 19, 2019

antiochp commented Feb 19, 2019 • edited Loading

antiochp commented Feb 19, 2019 •

edited

Loading