migration,backupccl: deal with full cluster backup/restore and migration jobs #60307

ajwerner · 2021-02-10T05:10:06Z

Describe the problem

In #59760 we introduce a job to run cluster version migrations. This is great because it offers many benefits of the jobs ecosystem like pausing and mutual exclusion. A downside is that a full cluster backup is going to pick up those jobs which may not be safe to resume on the restored cluster.

Possible Solution

I'm thinking a simple solution would be to encode the cluster id of the creating cluster into the job. Then, when the job is resumed on a different cluster, it can just error out.

Epic CRDB-8816

Jira issue: CRDB-3191

ajwerner · 2021-03-22T15:08:42Z

@dt should this be a GA blocker? I just realized this was missing triage labels :(

dt · 2021-03-22T22:33:56Z

I don't think so. We won't restore a newer cluster version to an older cluster thanks to #61737 so it's just be restoring to an older one, and then it'd migrate up, which is probably what we want. Maybe some weirdness if there are system tables that are not restored so they're post-migration and then we run that migration again, but so far all migrations have had to make themselves idempotent anyway.

There's still the issue of non-cluster restore and bringing a non-migrated table into a migrated cluster, but we're no worse off than we were before 21.1 I believe in that we still expect authors of such migrations to fix it during restore.

github-actions · 2023-09-05T11:07:57Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

ajwerner added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label Feb 10, 2021

ajwerner mentioned this issue Feb 10, 2021

migration,jobs: refactor long-running migrations and hook up job #59760

Merged

pbardea added the A-disaster-recovery label Mar 29, 2021

mwang1026 added the T-disaster-recovery label May 19, 2021

github-actions bot added the no-issue-activity label Sep 5, 2023

github-actions bot added the X-stale label Sep 18, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 18, 2023

exalate-issue-sync bot closed this as completed Sep 18, 2023

github-project-automation bot added this to Disaster Recovery Backlog Aug 28, 2024

github-project-automation bot moved this to Done in Disaster Recovery Backlog Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

migration,backupccl: deal with full cluster backup/restore and migration jobs #60307

migration,backupccl: deal with full cluster backup/restore and migration jobs #60307

ajwerner commented Feb 10, 2021 •

edited by cockroach-jira-scripts

Loading

ajwerner commented Mar 22, 2021

dt commented Mar 22, 2021

github-actions bot commented Sep 5, 2023

migration,backupccl: deal with full cluster backup/restore and migration jobs #60307

migration,backupccl: deal with full cluster backup/restore and migration jobs #60307

Comments

ajwerner commented Feb 10, 2021 • edited by cockroach-jira-scripts Loading

ajwerner commented Mar 22, 2021

dt commented Mar 22, 2021

github-actions bot commented Sep 5, 2023

ajwerner commented Feb 10, 2021 •

edited by cockroach-jira-scripts

Loading