Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

backupccl: flake in TestClusterRestoreFailCleanup/retry-during-custom-system-table-restore/settings #93460

Closed
yuzefovich opened this issue Dec 12, 2022 · 5 comments
Labels
C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-disaster-recovery

Comments

@yuzefovich
Copy link
Member

yuzefovich commented Dec 12, 2022

On this bors run.

=== RUN   TestClusterRestoreFailCleanup/retry-during-custom-system-table-restore/settings
*
* ERROR: a panic has occurred!
* expected *DOid, found tree.dNull
* (1) attached stack trace
*   -- stack trace:
*   | runtime.gopanic
*   | 	GOROOT/src/runtime/panic.go:884
*   | [...repeated from below...]
* Wraps: (2) assertion failure
* Wraps: (3) attached stack trace
*   -- stack trace:
*   | github.com/cockroachdb/cockroach/pkg/sql/sem/tree.MustBeDOid
*   | 	github.com/cockroachdb/cockroach/pkg/sql/sem/tree/datum.go:5244
*   | github.com/cockroachdb/cockroach/pkg/ccl/backupccl.roleIDSeqRestoreFunc
*   | 	github.com/cockroachdb/cockroach/pkg/ccl/backupccl/system_schema.go:362
*   | github.com/cockroachdb/cockroach/pkg/ccl/backupccl.(*restoreResumer).restoreSystemTables.func3
*   | 	github.com/cockroachdb/cockroach/pkg/ccl/backupccl/restore_job.go:2947
...

Jira issue: CRDB-22347

@yuzefovich yuzefovich added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label Dec 12, 2022
@blathers-crl
Copy link

blathers-crl bot commented Dec 12, 2022

cc @cockroachdb/disaster-recovery

@msbutler
Copy link
Collaborator

pretty sure this is #92848. I saw a similar error here #91991 (comment)

@rytaft
Copy link
Collaborator

rytaft commented Dec 19, 2022

This test flaked again in a slightly different way:

Failed
=== RUN   TestClusterRestoreFailCleanup
    test_log_scope.go:161: test logs captured to: /artifacts/tmp/_tmp/a3d5b6281df336d2ed5d8f2bd8c98231/logTestClusterRestoreFailCleanup3603781448
    test_log_scope.go:79: use -show-logs to present logs inline
    full_cluster_backup_restore_test.go:618: error executing 'BACKUP TO 'nodelocal://1/missing-ssts'': pq: failed to run backup: writing forward-compat metadata sst: pebble: keys must be added in strictly increasing order: "stats/\x8c\x88"/0,0#0,SET, "stats/\x8c\x88"/0,0#0,SET
    testutils.go:199: no Invalid Descriptors
    panic.go:522: -- test log scope end --
test logs left over in: /artifacts/tmp/_tmp/a3d5b6281df336d2ed5d8f2bd8c98231/logTestClusterRestoreFailCleanup3603781448
--- FAIL: TestClusterRestoreFailCleanup (227.24s)

Let me know if you want me to open a separate issue.

@msbutler
Copy link
Collaborator

aha, just fixed that failure mode here #93837. Ty for flagging!

@stevendanna
Copy link
Collaborator

I'm going to close this as we've fixed the issues implicated here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-disaster-recovery
Projects
No open projects
Archived in project
Development

No branches or pull requests

4 participants