storage: fatal on corruption encountered in background #102252

jbowens · 2023-04-25T15:56:42Z

Previously, on-disk corruption would only fatal the node if an interator observed it. Corruption encountered by a background job like a compaction would not fatal the node. This can result in busy churning through compactions that repeatedly fail, impacting cluster stability and user query latencies.

Now, on-disk corruption results in immediately exiting the node.

Epic: none
Fixes: #101101
Release note (ops change): When local corruption of data is encountered by a background job, a node will now exit immediately.

Previously, on-disk corruption would only fatal the node if an interator observed it. Corruption encountered by a background job like a compaction would not fatal the node. This can result in busy churning through compactions that repeatedly fail, impacting cluster stability and user query latencies. Now, on-disk corruption results in immediately exiting the node. Epic: none Fixes: cockroachdb#101101 Release note (ops change): When local corruption of data is encountered by a background job, a node will now exit immediately.

blathers-crl · 2023-04-25T15:56:45Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2023-04-25T15:56:50Z

This change is

RaduBerinde

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @itsbilal)

jbowens · 2023-04-25T17:57:49Z

TFTR!

bors r=RaduBerinde

craig · 2023-04-25T18:45:46Z

Build succeeded:

Bazel Essential CI (Cockroach)

jbowens added backport-22.2.x backport-23.1.x Flags PRs that need to be backported to 23.1 labels Apr 25, 2023

jbowens requested a review from a team as a code owner April 25, 2023 15:56

jbowens requested a review from itsbilal April 25, 2023 15:56

RaduBerinde approved these changes Apr 25, 2023

View reviewed changes

craig bot merged commit 605382e into cockroachdb:master Apr 25, 2023

This was referenced Apr 25, 2023

release-22.2: storage: fatal on corruption encountered in background #102273

Merged

release-23.1: storage: fatal on corruption encountered in background #102274

Merged

jbowens deleted the fatal-corruption branch April 25, 2023 20:04

cockroach-teamcity mentioned this pull request Apr 26, 2023

PR #102252 - storage: fatal on corruption encountered in background cockroachdb/docs#16845

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: fatal on corruption encountered in background #102252

storage: fatal on corruption encountered in background #102252

jbowens commented Apr 25, 2023

blathers-crl bot commented Apr 25, 2023

cockroach-teamcity commented Apr 25, 2023

RaduBerinde left a comment

jbowens commented Apr 25, 2023

craig bot commented Apr 25, 2023

storage: fatal on corruption encountered in background #102252

storage: fatal on corruption encountered in background #102252

Conversation

jbowens commented Apr 25, 2023

blathers-crl bot commented Apr 25, 2023

cockroach-teamcity commented Apr 25, 2023

RaduBerinde left a comment

Choose a reason for hiding this comment

jbowens commented Apr 25, 2023

craig bot commented Apr 25, 2023