storage,kv: tolerate corruption of sideloaded sstables #91029

jbowens · 2022-10-31T22:04:53Z

If an AddSSTable's sstable is sideloaded and becomes corrupted (eg, due to a bad disk), the operator has no recourse other than to replace the node.

This issue is intended to track isolation of corruption of the raft log / sideloaded sstables, in contrast to #67568 which tracks recovery from corruption of already-applied state.

See #90834 for an example.

Jira issue: CRDB-21080

blathers-crl · 2022-10-31T22:04:56Z

cc @cockroachdb/replication

erikgrinaker · 2022-11-03T15:52:25Z

This is related to #75903, in that any failure to apply a Raft command will crash the node anyway. The proposed solution there is to cordon the replica (and then discard the faulty replica and upreplicate elsewhere, unless all replicas are faulty), which is likely the preferable approach here as well. That said, if the SST is corrupt then the disks are likely faulty, so we may not want to keep the node running and risk further corruption anyway.

github-actions · 2024-04-29T11:04:37Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

github-actions bot added the no-issue-activity label Apr 29, 2024

jbowens removed the no-issue-activity label Apr 29, 2024

jbowens added this to [Deprecated] Storage Jun 4, 2024

jbowens moved this to Backlog in [Deprecated] Storage Jun 4, 2024

jlinder added the T-kv KV Team label Jun 28, 2024

jlinder removed the T-kv-replication label Jun 28, 2024

exalate-issue-sync bot removed the T-kv KV Team label Jul 16, 2024

github-project-automation bot added this to KV Aug 28, 2024

github-project-automation bot moved this to Incoming in KV Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage,kv: tolerate corruption of sideloaded sstables #91029

storage,kv: tolerate corruption of sideloaded sstables #91029

jbowens commented Oct 31, 2022 •

edited by cockroach-jira-scripts

Loading

blathers-crl bot commented Oct 31, 2022

erikgrinaker commented Nov 3, 2022

github-actions bot commented Apr 29, 2024

storage,kv: tolerate corruption of sideloaded sstables #91029

storage,kv: tolerate corruption of sideloaded sstables #91029

Comments

jbowens commented Oct 31, 2022 • edited by cockroach-jira-scripts Loading

blathers-crl bot commented Oct 31, 2022

erikgrinaker commented Nov 3, 2022

github-actions bot commented Apr 29, 2024

jbowens commented Oct 31, 2022 •

edited by cockroach-jira-scripts

Loading