backupccl: add verify_backup_table_data option to RESTORE #86136

msbutler · 2022-08-15T13:23:22Z

Release note (sql change): this patch adds the verify_backup_table_data flag to
RESTORE. When the user passes this flag, along with the required schema_only
flag, a schema_only RESTORE will get run and all user data will get read from
external storage, checksummed, and disarded before getting written to disk.

This flag provides two additional validation steps that a regular schema_only
RESTORE and a SHOW BACKUP with check_files cannot provide: This RESTORE
verifies that all data can get read and rekeyed to the Restoring Cluster, and
that all data passes a checksum check.

Release justification: low risk, high impact change to improve restore
validation

cockroach-teamcity · 2022-08-15T13:23:32Z

This change is

dt · 2022-08-18T22:45:17Z

pkg/sql/exec_util.go

@@ -1612,6 +1612,9 @@ type BackupRestoreTestingKnobs struct {
 	// testing. This is typically the bulk mem monitor if not
 	// specified here.
 	BackupMemMonitor *mon.BytesMonitor
+
+	// RecoverFromIterClosePanic prevents the node from panicing during ReadAsOfIterator.Close
+	RecoverFromIterPanic bool


as discussed offline, we shouldn't need this if we add a no-panic option to pebble iter

planning to rebase on #86423 which will have the fix.

dt · 2022-08-18T22:49:05Z

pkg/ccl/backupccl/restore_data_processor.go

 	// sendIter sends a multiplexed iterator covering the currently accumulated files over the
 	// channel.
 	sendIter := func(iter storage.SimpleMVCCIterator, dirsToSend []cloud.ExternalStorage) error {
 		readAsOfIter := storage.NewReadAsOfIterator(iter, rd.spec.RestoreTime)

 		cleanup := func() {
+			if recoverFromIterPanic {


as discussed offline, we shouldn't need this once we fix the pebble iterator.

pkg/ccl/backupccl/restore_data_processor.go

dt · 2022-08-18T22:54:33Z

pkg/ccl/backupccl/restore_job.go

+		return nil, nil, nil, err
+	}
+	if !backupCodec.TenantPrefix().Equal(p.ExecCfg().Codec.TenantPrefix()) {
+		// Ensure old processors fail if this is a previously unsupported restore of


I think this x-version code isn't needed anymore, but I don't remember if we rely on the zero rekey for anything else

I grepped for rekey.OldID == 0 ,poison-pill, and execinfrapb.TableRekey{} throughout the codebase and no other application turned up.

Given that all 22.1 processors know how to rekey, I think it's safe remove introducing the poison pill in restore_job.go. 22.2 processors still need to filter over the poison pill if a 22.1 processor planned the job.

God forbid if a RESTORE can run over 2 version upgrades.

pkg/ccl/backupccl/restore_job.go

This small refactor pushes tenant rekeying logic from the main restore_job Resume() function into createImportingDescriptors. Release note: None

Release note (sql change): this patch adds the verify_backup_table_data flag to RESTORE. When the user passes this flag, along with the required schema_only flag, a schema_only RESTORE will get run _and_ all user data will get read from external storage, checksummed, and disarded before getting written to disk. This flag provides two additional validation steps that a regular schema_only RESTORE and a SHOW BACKUP with check_files cannot provide: This RESTORE verifies that all data can get read and rekeyed to the Restoring Cluster, and that all data passes a checksum check. Release justification: low risk, high impact change to improve restore validation

msbutler · 2022-08-21T14:46:04Z

bors r=dt

craig · 2022-08-21T15:59:50Z

Build succeeded:

Bazel Essential CI (Cockroach)

msbutler self-assigned this Aug 15, 2022

msbutler changed the title ~~Butler refactor tenant rekey~~ backupccl: add verify_backup_table_data option to RESTORE Aug 15, 2022

msbutler requested review from rhu713 and dt August 15, 2022 14:23

msbutler marked this pull request as ready for review August 15, 2022 14:23

msbutler requested review from a team as code owners August 15, 2022 14:23

msbutler requested a review from a team August 15, 2022 14:23

msbutler requested review from a team as code owners August 15, 2022 14:23

msbutler force-pushed the butler-refactor-tenant-rekey branch 2 times, most recently from 8b4579a to ca97c04 Compare August 18, 2022 15:09

dt reviewed Aug 18, 2022

View reviewed changes

dt approved these changes Aug 19, 2022

View reviewed changes

msbutler force-pushed the butler-refactor-tenant-rekey branch 2 times, most recently from 6c7b433 to febc9aa Compare August 19, 2022 22:48

msbutler added 2 commits August 20, 2022 12:57

backupccl: push tenant rekeying planning into createImportingDescriptors

96c9491

This small refactor pushes tenant rekeying logic from the main restore_job Resume() function into createImportingDescriptors. Release note: None

msbutler force-pushed the butler-refactor-tenant-rekey branch from febc9aa to 58df28d Compare August 20, 2022 18:57

craig bot merged commit 8ed07d1 into cockroachdb:master Aug 21, 2022

cockroach-teamcity mentioned this pull request Aug 21, 2022

backupccl: add verify_backup_table_data option to RESTORE cockroachdb/docs#14900

Closed

msbutler deleted the butler-refactor-tenant-rekey branch August 21, 2022 16:43

msbutler mentioned this pull request Aug 24, 2022

backupccl: add verify_backup_table_data option to RESTORE #83671

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backupccl: add verify_backup_table_data option to RESTORE #86136

backupccl: add verify_backup_table_data option to RESTORE #86136

msbutler commented Aug 15, 2022 •

edited

Loading

cockroach-teamcity commented Aug 15, 2022

dt Aug 18, 2022

msbutler Aug 19, 2022

dt Aug 18, 2022

dt Aug 18, 2022

msbutler Aug 19, 2022

msbutler commented Aug 21, 2022

craig bot commented Aug 21, 2022

backupccl: add verify_backup_table_data option to RESTORE #86136

backupccl: add verify_backup_table_data option to RESTORE #86136

Conversation

msbutler commented Aug 15, 2022 • edited Loading

cockroach-teamcity commented Aug 15, 2022

dt Aug 18, 2022

Choose a reason for hiding this comment

msbutler Aug 19, 2022

Choose a reason for hiding this comment

dt Aug 18, 2022

Choose a reason for hiding this comment

dt Aug 18, 2022

Choose a reason for hiding this comment

msbutler Aug 19, 2022

Choose a reason for hiding this comment

msbutler commented Aug 21, 2022

craig bot commented Aug 21, 2022

msbutler commented Aug 15, 2022 •

edited

Loading