Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

backupccl: TestBackupRestoreJobTagAndLabel fails under race when running in a tenant #107336

Closed
stevendanna opened this issue Jul 21, 2023 · 1 comment · Fixed by #107412
Closed
Assignees
Labels
C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-multitenant Issues owned by the multi-tenant virtual team

Comments

@stevendanna
Copy link
Collaborator

stevendanna commented Jul 21, 2023

Describe the problem

This test was recently enable for tenant testing. It has since occasionally failed under race with an assertion failure:

I230721 09:57:17.481189 134873 1@kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2151  the server is terminating due to a fatal error (see the DEV channel for details)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152  observed raft log position less than per-stream lower bound (log-position=0/0 <= log-position=6/3189)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !goroutine 134873 [running]:
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !runtime/debug.Stack()
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	GOROOT/src/runtime/debug/stack.go:24 +0x72
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/util/log.(*loggerT).outputLogEntry(0xc015df4f60, {{{0xc00cc48690, 0x24}, {0x968dfdd, 0x1}, {0x968dfdd, 0x1}, {0x968dfdd, 0x1}}, 0x1773d981fb22f376, ...})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/util/log/clog.go:261 +0xb7
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/util/log.logfDepthInternal({0xd76a340, 0xc00a858fc0}, 0x2, 0x4, 0x0, 0x0, {0x97c2cf1, 0x46}, {0xc034597700, 0x2, ...})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/util/log/channels.go:106 +0x6e6
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/util/log.logfDepth(...)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/util/log/channels.go:39
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/util/log.Fatalf({0xd76a340, 0xc00a858fc0}, {0x97c2cf1, 0x46}, {0xc034597700, 0x2, 0x2})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/bazel-out/k8-fastbuild/bin/pkg/util/log/log_channels_generated.go:848 +0x9d
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowtokentracker.(*Tracker).Track(0xc003ce3110, {0xd76a340, 0xc00a858fc0}, 0x0, 0x331, {0xc014808e50?, 0x4?})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 +0x85b
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowhandle.(*Handle).deductTokensForInner(0xc009a744d0, {0xd76a340, 0xc00a858fc0}, 0x0?, {0x13860320?, 0xc018493380?}, 0x331)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowhandle/kvflowhandle.go:162 +0x394
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowhandle.(*Handle).DeductTokensFor(0xc009a744d0, {0xd76a340, 0xc00a858fc0}, 0x77?, {0x2c?, 0xc014808f30?}, 0x1?)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/kvflowcontrol/kvflowhandle/kvflowhandle.go:141 +0x72
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.maybeDeductFlowTokens({0xd76a340, 0xc008f8fcb0}, {0xd7b08c0, 0xc009a744d0}, {0xc0252d2700, 0x12, 0x12}, {0xc01e231880, 0x12, 0x12})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_proposal_buf.go:1049 +0x2e5
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.proposeBatch({0xd76a340, 0xc008f8fcb0}, {0x7f52e49b2b00, 0xc004f12d80}, {0xd78a0f0, 0xc00cb32870}, {0xc01e231880, 0x12, 0x12}, {0xc0252d2700, ...}, ...)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_proposal_buf.go:1026 +0x24f
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*propBuf).FlushLockedWithRaftGroup(0xc004f135d0, {0xd76a340, 0xc008f8fcb0}, {0xd78a0f0, 0xc00cb32870})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_proposal_buf.go:619 +0x1c97
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*Replica).handleRaftReadyRaftMuLocked.func2(0xc00ccbacb8?)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_raft.go:775 +0xfd
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*Replica).withRaftGroupLocked(0xc004f12d80, 0xc01480a760)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_raft.go:1988 +0xae
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*Replica).handleRaftReadyRaftMuLocked(_, {_, _}, {{0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, ...}, ...})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_raft.go:772 +0x39b
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*Replica).handleRaftReady(_, {_, _}, {{0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, ...}, ...})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/replica_raft.go:728 +0x210
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*Store).processReady(0xc009758000, 0xd76a340?)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/store_raft.go:670 +0x1d0
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*raftSchedulerShard).worker(0xc008285500, {0xd76a340, 0xc01be7f1a0}, {0xd74d9b0, 0xc009758000}, 0xc0096df500)
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/scheduler.go:418 +0x2f6
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/kv/kvserver.(*raftScheduler).Start.func2({0xd76a340, 0xc01be7f1a0})
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/kv/kvserver/pkg/kv/kvserver/scheduler.go:321 +0x9b
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !github.com/cockroachdb/cockroach/pkg/util/stop.(*Stopper).RunAsyncTaskEx.func2()
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/util/stop/stopper.go:484 +0x1f7
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !created by github.com/cockroachdb/cockroach/pkg/util/stop.(*Stopper).RunAsyncTaskEx
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !	github.com/cockroachdb/cockroach/pkg/util/stop/stopper.go:475 +0x61a
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !For more context, check log files in: /artifacts/tmp/_tmp/95e138e66d69292427dfb9528cf06d04/logTestBackupRestoreJobTagAndLabel1075111918
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !****************************************************************************
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !This node experienced a fatal error (printed above), and as a result the
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !process is terminating.
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !Fatal errors can occur due to faulty hardware (disks, memory, clocks) or a
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !problem in CockroachDB. With your help, the support team at Cockroach Labs
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !will try to determine the root cause, recommend next steps, and we can
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !improve CockroachDB based on your report.
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !Please submit a crash report by following the instructions here:
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !    https://github.com/cockroachdb/cockroach/issues/new/choose
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !If you would rather not post publicly, please contact us directly at:
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 â‹® [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !    [email protected]
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !
F230721 09:57:17.481448 134873 kv/kvserver/kvflowcontrol/kvflowtokentracker/tracker.go:95 ⋮ [T1,n1,tenant=10,s1,r64/1:‹/Tenant/1{0-1}›] 2152 !The Cockroach Labs team appreciates your feedback.

Jira issue: CRDB-29995

@stevendanna stevendanna added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-multitenant Issues owned by the multi-tenant virtual team labels Jul 21, 2023
@stevendanna stevendanna self-assigned this Jul 21, 2023
stevendanna added a commit to stevendanna/cockroach that referenced this issue Jul 21, 2023
This test fails under race when running in a tenant
occasionally. Requires further investigation.

Informs: cockroachdb#107336

Release note: None
@stevendanna
Copy link
Collaborator Author

@irfansharif I've heard a rumor you might already know what is up here and have a fix?

craig bot pushed a commit that referenced this issue Jul 21, 2023
107334: backupccl: possibly deflake TestBackupRestoreAppend r=adityamaru a=stevendanna

In CI we've seen this test fail with:

    backup_test.go:670: error scanning '&{<nil> 0xc019c5b580}': pq:
    restart transaction: TransactionRetryWithProtoRefreshError:
    TransactionRetryError: retry txn (RETRY_SERIALIZABLE - failed
    preemptive refresh): "sql txn" meta={id=77054b51 key=/Table/121/1
    pri=0.03587869 epo=0 ts=1689916873.568949604,1
    min=1689916873.436580640,0 seq=1000} lock=true stat=PENDING
    rts=1689916873.436580640,0 wto=false gul=1689916873.936580640,0

The `RETURNING` clauses on these two `UPDATE` statements prevent automatic transaction retries.

Here we wrap the queries in an explicit transaction with a retry loop which should prevent the test failure test, assuming that the update isn't so contended it won't ever complete.

I am unable to stress this enough locally to reproduce this error.

Probably Fixes #107330

Epic: none

Release note: None

107338: backupccl: skip TestBackupRestoreJobTagAndLabel under race r=dt a=stevendanna

This test fails under race when running in a tenant occasionally. Requires further investigation.

Informs: #107336

Release note: None

Co-authored-by: Steven Danna <[email protected]>
craig bot pushed a commit that referenced this issue Jul 24, 2023
107412: kvserver: fix 'observed raft log position' assertion r=irfansharif a=irfansharif

Fixes #107336.
Fixes #106123.
Fixes #107156.
Fixes #106589.

It's possible to hit this assertion under --stress --race when the proposing replica is starved enough for raft ticks that it loses leadership right when it steps proposals through raft. We're relying on undocumented API semantics in the etcd raft library whereby it mutates stepped entries with the term+index its to end up in. But that's only applicable if stepping through entries as a leader. Simply relax this assertion instead.

Release note: None

Co-authored-by: irfan sharif <[email protected]>
@craig craig bot closed this as completed in deb538f Jul 24, 2023
THardy98 pushed a commit to THardy98/cockroach that referenced this issue Jul 24, 2023
This test fails under race when running in a tenant
occasionally. Requires further investigation.

Informs: cockroachdb#107336

Release note: None
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-multitenant Issues owned by the multi-tenant virtual team
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants