storage: add WAL failover configuration #120509

jbowens · 2024-03-14T18:18:25Z

Introduce support for configuring a multi-store CockroachDB node to failover a
store's write-ahead log (WAL) to another store's data directory. Failing over
the write-ahead log may allow some operations against a store to continue to
complete despite temporary unavailability of the underlying storage.

Customers must opt into WAL failover by passing --wal-failover=among-stores
to cockroach start or setting the env var
COCKROACH_WAL_FAILOVER=among-stores. On start, cockroach will assign each
store another store to be its failover destination. Cockroach will begin
monitoring the latency of all WAL writes. If latency to the WAL exceeds the
value of the storage.wal_failover.unhealthy_op_threshold cluster setting,
Cockroach will attempt to write WAL entries to its secondary store's volume.

If a user wishes to disable WAL failover, they must restart the node setting
--wal-failover=disabled.

Close #119418.
Informs cockroachdb/pebble#3230
Epic: CRDB-35401

Release note (ops change): Introduces a new start option (--wal-failover or
COCKROACH_WAL_FAILOVER env var) to opt into failing over WALs between stores in
multi-store nodes. Introduces a new storage.wal_failover.unhealthy_op_threshold
cluster setting for configuring the latency threshold at which a WAL write is
considered unhealthy.

blathers-crl · 2024-03-14T18:18:29Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2024-03-14T18:18:37Z

This change is

sumeerbhola

Reviewed 11 of 11 files at r1, 8 of 8 files at r2, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @itsbilal and @jbowens)

pkg/server/config.go line 761 at r1 (raw file):

	defer storeEnvs.CloseAll()

	walFailoverConfig := storage.WALFailover(cfg.WALFailover, storeEnvs)

I don't see storage.WALFailover on master or in this commit.

pkg/storage/open.go line 292 at r2 (raw file):

					UnhealthyOperationLatencyThreshold: func() time.Duration {
						return walFailoverUnhealthyOpThreshold.Get(&cfg.Settings.SV)
					},

We should probably reduce FailoverOptions.UnhealthySamplingInterval to 25ms given the default for the cluster setting is 100ms. Polling every 25ms is cheap enough.
Also FailoverOptions.ElevatedWriteStallThresholdLag doesn't seem to have a default in Pebble (oversight?). Something like 60s may be ok.
And HealthyInterval default of 2min is too long, I think. Something like 15s seems more reasonable.

Of course, these are all guesses without experimental validation under real conditions.

sumeerbhola

This turned out very clean!

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @itsbilal and @jbowens)

jbowens

TFTR!

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @itsbilal and @sumeerbhola)

pkg/server/config.go line 761 at r1 (raw file):

Previously, sumeerbhola wrote…

I don't see storage.WALFailover on master or in this commit.

you found it, right?

pkg/storage/open.go line 292 at r2 (raw file):

Previously, sumeerbhola wrote…

We should probably reduce FailoverOptions.UnhealthySamplingInterval to 25ms given the default for the cluster setting is 100ms. Polling every 25ms is cheap enough.
Also FailoverOptions.ElevatedWriteStallThresholdLag doesn't seem to have a default in Pebble (oversight?). Something like 60s may be ok.
And HealthyInterval default of 2min is too long, I think. Something like 15s seems more reasonable.

Of course, these are all guesses without experimental validation under real conditions.

Put up cockroachdb/pebble#3413 to change them there

sumeerbhola

Reviewed 5 of 8 files at r4.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @itsbilal and @jbowens)

pkg/server/config.go line 761 at r1 (raw file):

Previously, jbowens (Jackson Owens) wrote…

you found it, right?

yep. But isn't it in the wrong commit, in that CockroachDB won't build after the first commit?

jbowens

TFTR!

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @itsbilal and @sumeerbhola)

pkg/server/config.go line 761 at r1 (raw file):

Previously, sumeerbhola wrote…

yep. But isn't it in the wrong commit, in that CockroachDB won't build after the first commit?

whoops, sorry, I somehow missed this. Moved it into second commit and verified that both commits build

jbowens · 2024-03-18T14:21:53Z

bors r=sumeerbhola

craig · 2024-03-18T14:39:44Z

Build failed (retrying...):

Bazel Essential CI (Cockroach)

craig · 2024-03-18T15:23:24Z

Build failed (retrying...):

Bazel Essential CI (Cockroach)

rickystewart · 2024-03-18T15:24:53Z

bors r-

This looks like merge skew to me.

craig · 2024-03-18T15:24:56Z

Canceled.

jbowens · 2024-03-18T15:33:46Z

I believe the other commit within the batch (from #119885) contains the skew.

Edit: They both did

bors r=sumeerbhola

Add reference counting to fs.Envs, so that multiple Engines can take on references to an Env. The underlying Env's resources won't be released until all references are released. This will be used by WAL failover to ensure that an fs.Env used as a failover destination isn't closed prematurely. Epic: none Release note: none

Introduce support for configuring a multi-store CockroachDB node to failover a store's write-ahead log (WAL) to another store's data directory. Failing over the write-ahead log may allow some operations against a store to continue to complete despite temporary unavailability of the underlying storage. Customers must opt into WAL failover by passing `--wal-failover=among-stores` to `cockroach start` or setting the env var `COCKROACH_WAL_FAILOVER=among-stores`. On start, cockroach will assign each store another store to be its failover destination. Cockroach will begin monitoring the latency of all WAL writes. If latency to the WAL exceeds the value of the storage.wal_failover.unhealthy_op_threshold cluster setting, Cockroach will attempt to write WAL entries to its secondary store's volume. If a user wishes to disable WAL failover, they must restart the node setting `--wal-failover=disabled`. Close cockroachdb#119418. Informs cockroachdb/pebble#3230 Epic: CRDB-35401 Release note (ops change): Introduces a new start option (--wal-failover or COCKROACH_WAL_FAILOVER env var) to opt into failing over WALs between stores in multi-store nodes. Introduces a new storage.wal_failover.unhealthy_op_threshold cluster setting for configuring the latency threshold at which a WAL write is considered unhealthy.

craig · 2024-03-18T15:39:03Z

Canceled.

jbowens · 2024-03-18T15:44:57Z

bors r=sumeerbhola

craig · 2024-03-18T16:57:45Z

Build succeeded:

This commit expands on cockroachdb#120509, introducing a WAL failover mode that allows an operator of a node with a single store to configure WAL failover to failover to a particular path (rather than another store's directory). This is configured via the --wal-failover flag: --wal-failover=path=/mnt/data2 When disabling or changing the path, the operator is required to pass the previous path. Eg, --wal_failover=path=/mnt/data3,prev_path=/mnt/data2 or --wal_failover=disabled,prev_path=/mnt/data2 Informs cockroachdb#119418. Informs cockroachdb/pebble#3230 Epic: CRDB-35401 Release note (ops change): Adds an additional option to the new (in 24.1) --wal-failover CLI flag allowing an operator to specify an explicit path for WAL failover for single-store nodes.

This commit expands on cockroachdb#120509, introducing a WAL failover mode that allows an operator of a node with a single store to configure WAL failover to failover to a particular path (rather than another store's directory). This is configured via the --wal-failover flag: --wal-failover=path=/mnt/data2 When disabling or changing the path, the operator is required to pass the previous path. Eg, --wal-failover=path=/mnt/data3,prev_path=/mnt/data2 or --wal-failover=disabled,prev_path=/mnt/data2 Informs cockroachdb#119418. Informs cockroachdb/pebble#3230 Epic: CRDB-35401 Release note (ops change): Adds an additional option to the new (in 24.1) --wal-failover CLI flag allowing an operator to specify an explicit path for WAL failover for single-store nodes.

120783: storage: support WAL failover to an explicit path r=sumeerbhola a=jbowens This commit expands on #120509, introducing a WAL failover mode that allows an operator of a node with a single store to configure WAL failover to failover to a particular path (rather than another store's directory). This is configured via the --wal-failover flag: --wal-failover=path=/mnt/data2 When disabling or changing the path, the operator is required to pass the previous path. Eg, --wal_failover=path=/mnt/data3,prev_path=/mnt/data2 or --wal_failover=disabled,prev_path=/mnt/data2 Informs #119418. Informs cockroachdb/pebble#3230 Epic: CRDB-35401 Release note (ops change): Adds an additional option to the new (in 24.1) --wal-failover CLI flag allowing an operator to specify an explicit path for WAL failover for single-store nodes. Co-authored-by: Jackson Owens <[email protected]>

jbowens force-pushed the wal-failover branch 5 times, most recently from a20b128 to d62c7ae Compare March 14, 2024 20:01

jbowens marked this pull request as ready for review March 14, 2024 21:30

jbowens requested review from a team as code owners March 14, 2024 21:30

jbowens requested review from itsbilal and sumeerbhola March 14, 2024 21:30

sumeerbhola approved these changes Mar 14, 2024

View reviewed changes

sumeerbhola reviewed Mar 14, 2024

View reviewed changes

jbowens force-pushed the wal-failover branch 2 times, most recently from b3df667 to 96de542 Compare March 15, 2024 17:16

jbowens commented Mar 15, 2024

View reviewed changes

sumeerbhola approved these changes Mar 15, 2024

View reviewed changes

jbowens force-pushed the wal-failover branch from 96de542 to a7c2241 Compare March 18, 2024 13:35

jbowens commented Mar 18, 2024

View reviewed changes

jbowens added 2 commits March 18, 2024 11:34

jbowens force-pushed the wal-failover branch from a7c2241 to 1245a7e Compare March 18, 2024 15:39

craig bot merged commit 690c67f into cockroachdb:master Mar 18, 2024
21 of 22 checks passed

jbowens deleted the wal-failover branch March 18, 2024 17:01

jbowens mentioned this pull request Mar 20, 2024

storage: support WAL failover to an explicit path #120783

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: add WAL failover configuration #120509

storage: add WAL failover configuration #120509

jbowens commented Mar 14, 2024 •

edited

Loading

blathers-crl bot commented Mar 14, 2024

cockroach-teamcity commented Mar 14, 2024

sumeerbhola left a comment

sumeerbhola left a comment

jbowens left a comment

sumeerbhola left a comment

jbowens left a comment

jbowens commented Mar 18, 2024

craig bot commented Mar 18, 2024

craig bot commented Mar 18, 2024

rickystewart commented Mar 18, 2024

craig bot commented Mar 18, 2024

jbowens commented Mar 18, 2024 •

edited

Loading

craig bot commented Mar 18, 2024

jbowens commented Mar 18, 2024

craig bot commented Mar 18, 2024

storage: add WAL failover configuration #120509

storage: add WAL failover configuration #120509

Conversation

jbowens commented Mar 14, 2024 • edited Loading

blathers-crl bot commented Mar 14, 2024

cockroach-teamcity commented Mar 14, 2024

sumeerbhola left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

jbowens left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

jbowens left a comment

Choose a reason for hiding this comment

jbowens commented Mar 18, 2024

craig bot commented Mar 18, 2024

craig bot commented Mar 18, 2024

rickystewart commented Mar 18, 2024

craig bot commented Mar 18, 2024

jbowens commented Mar 18, 2024 • edited Loading

craig bot commented Mar 18, 2024

jbowens commented Mar 18, 2024

craig bot commented Mar 18, 2024

jbowens commented Mar 14, 2024 •

edited

Loading

jbowens commented Mar 18, 2024 •

edited

Loading