backupccl: memory monitored restore processor erroneously restores deleted data #103334

stevendanna · 2023-05-15T17:36:46Z

Describe the problem

The new memory monitored restored attempts to limit the number of SSTs added to a single iterator. When a given span has more SSTs that would be allowed by the memory monitor, it attempts to process one set of SSTs first, and then the second set.

This, however, poses a problem for deletions. Currently, we do not write deletions directly, rather we depend on the iterator never returning a deleted key. This assumption was correct when all SSTs related to a key were definitely in the same iterator. It is no longer correct when the SSTs for a given iterator can be split over multiple iterators.

Proposed work to fix this

To fix this we think we need to (1) ensure that all SSTs for a given layer are all inside the same iterator and (2) change our usage of the iterator to raise deletion tombstones (both point and range keys) and then explicitly write those deletions during the restore process.

Unit test that reproduces the between-layer issue
Unit test that reproduces in in-layer issue
Pass layer information in RestoreSpanSpec
Ensure all SSTs from a single layer are added to a restore iterator
Raise point deletion tombstones in ReadAsOfInterator
Raise range tombstones in ReadAsOfIterator
Correctly write range tombstones during restore

See

Jira issue: CRDB-27949

Epic CRDB-28050

blathers-crl · 2023-05-15T17:44:56Z

cc @cockroachdb/disaster-recovery

stevendanna added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label May 15, 2023

shermanCRL added the A-disaster-recovery label May 15, 2023

blathers-crl bot added the T-disaster-recovery label May 15, 2023

msbutler mentioned this issue May 19, 2023

backupccl: allow more restore workers for large nodes #103604

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backupccl: memory monitored restore processor erroneously restores deleted data #103334

backupccl: memory monitored restore processor erroneously restores deleted data #103334

stevendanna commented May 15, 2023 •

edited by exalate-issue-sync bot

Loading

blathers-crl bot commented May 15, 2023

backupccl: memory monitored restore processor erroneously restores deleted data #103334

backupccl: memory monitored restore processor erroneously restores deleted data #103334

Comments

stevendanna commented May 15, 2023 • edited by exalate-issue-sync bot Loading

blathers-crl bot commented May 15, 2023

stevendanna commented May 15, 2023 •

edited by exalate-issue-sync bot

Loading