Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

backupccl: fingerprint 15GB restore roachtests #99792

Merged

Conversation

msbutler
Copy link
Collaborator

Previously, restore roachtests had little ability to detect data corruption regressions across runs. This patch introduces this ability. Specifically, this commit allows the restore roachtest writer to easily run a stripped fingerprint after a restore, and assert a match to the hardcoded fingerprint in the test spec.

For now, the fingerprint check is only run on the restore roachtests that restore 15GB of data. The check takes about the same amount of time it takes to run the restore (around 3 minutes), so before we use it on larger tests, we ought to consider adding performance improvements to the fingerprinting tool. These tests include:

  • restore/nodeShutdown/coordinator
  • restore/pause/tpce/15GB/aws/nodes=4/cpus=8 (used to restore 80GB)
  • restore/tpce/15GB/aws/nodes=4/cpus=8 (new test)
  • restore/nodeShutdown/worker (used to restore 80GB)
  • restore/nodeShutdown/coordinator (used to restore 80GB)

This patch also changes the node shutdown tests and the paused restore test to run the smaller 15GB tpce fixture, as it speeds the test run up.

Informs #98779

Release note: none

@msbutler msbutler requested review from rhu713 and lidorcarmel March 28, 2023 12:25
@msbutler msbutler self-assigned this Mar 28, 2023
@msbutler msbutler requested a review from a team as a code owner March 28, 2023 12:25
@msbutler msbutler requested review from smg260 and renatolabs and removed request for a team March 28, 2023 12:25
@cockroach-teamcity
Copy link
Member

This change is Reviewable

@msbutler msbutler force-pushed the butler-fingerprint-roachtest-restore branch from 1c903e4 to c284398 Compare March 28, 2023 13:00
@msbutler msbutler added the backport-23.1.x Flags PRs that need to be backported to 23.1 label Mar 28, 2023
pkg/cmd/roachtest/tests/restore.go Outdated Show resolved Hide resolved
Previously, restore roachtests had little ability to detect data corruption
regressions across runs. This patch introduces this ability. Specifically,
this commit allows the restore roachtest writer to easily run a stripped
fingerprint after a restore, and assert a match to the hardcoded fingerprint
in the test spec.

For now, the fingerprint check is only run on the restore roachtests that
restore 15GB of data. The check takes about the same amount of time it takes to
run the restore (around 3 minutes), so before we use it on larger tests, we
ought to consider adding performance improvements to the fingerprinting tool.
These tests include:
- restore/nodeShutdown/coordinator
- restore/pause/tpce/15GB/aws/nodes=4/cpus=8 (used to restore 80GB)
- restore/tpce/15GB/aws/nodes=4/cpus=8 (new test)
- restore/nodeShutdown/worker (used to restore 80GB)
- restore/nodeShutdown/coordinator (used to restore 80GB)

This patch also changes the node shutdown tests and the paused restore test to
run the smaller 15GB tpce fixture, as it speeds the test run up.

Informs cockroachdb#98779

Release note: none
@msbutler msbutler force-pushed the butler-fingerprint-roachtest-restore branch from c284398 to 2030b98 Compare March 30, 2023 11:59
@msbutler
Copy link
Collaborator Author

TFTR!

bors r=rhu713

@craig
Copy link
Contributor

craig bot commented Mar 30, 2023

Build failed (retrying...):

@craig
Copy link
Contributor

craig bot commented Mar 30, 2023

Build succeeded:

@craig craig bot merged commit f9ac216 into cockroachdb:master Mar 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport-23.1.x Flags PRs that need to be backported to 23.1 T-disaster-recovery
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants