Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

changefeedccl: TestSchemachangeDoesNotBreakSinklessFeed/sinkless is flaky #86777

Closed
nicktrav opened this issue Aug 24, 2022 · 2 comments
Closed
Labels
A-cdc Change Data Capture C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-cdc

Comments

@nicktrav
Copy link
Collaborator

nicktrav commented Aug 24, 2022

Describe the problem

The test TestSchemachangeDoesNotBreakSinklessFeed/sinkless is flaky.

Example failure during CI.

To Reproduce

Run the following locally to observe the same error from the TC build:

$ ./dev test ./pkg/ccl/changefeedccl --filter TestSchemachangeDoesNotBreakSinklessFeed/sinkless --stress
...
W220824 16:12:43.379186 588 kv/txn.go:704  [n1] 142  failure aborting transaction: node unavailable; try another peer; abort caused by: context canceled
E220824 16:12:43.391362 588 jobs/registry.go:763  [n1] 146  error claiming jobs: context canceled
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143  job 790649776371138561: running execution encountered retriable error: non-cancelable: job 790649776371138561: select-job: node unavailable; try another peer
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +(1) attached stack trace
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  -- stack trace:
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/jobs.(*Registry).stepThroughStateMachine
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  |       github.com/cockroachdb/cockroach/pkg/jobs/registry.go:1271
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/jobs.(*Registry).runJob
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  |       github.com/cockroachdb/cockroach/pkg/jobs/adopt.go:410
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/jobs.(*Registry).resumeJob.func1
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  |       github.com/cockroachdb/cockroach/pkg/jobs/adopt.go:330
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/util/stop.(*Stopper).RunAsyncTaskEx.func2
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  |       github.com/cockroachdb/cockroach/pkg/util/stop/stopper.go:489
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +Wraps: (2) non-cancelable
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +Wraps: (3) forced error mark
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | "retriable job error"
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/errors/withstack/*withstack.withStack::
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +Wraps: (4) attached stack trace
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  -- stack trace:
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/jobs.(*Job).update
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  |       github.com/cockroachdb/cockroach/pkg/jobs/update.go:296
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | [...repeated from below...]
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +Wraps: (5) job 790649776371138561
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +Wraps: (6) attached stack trace
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  -- stack trace:
I220824 16:12:43.380590 1640 jobs/registry.go:1273  [n1] 143 +  | github.com/cockroachdb/cockroach/pkg/sql.(*InternalExecutor).execInternal.func1.1
...
ERROR: exit status 1

17 runs completed, 1 failures, over 9s
...

Environment:

Observed on master @ bdadde2.

Jira issue: CRDB-18926

Epic CRDB-11732

@nicktrav nicktrav added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. A-cdc Change Data Capture T-cdc labels Aug 24, 2022
@blathers-crl
Copy link

blathers-crl bot commented Aug 24, 2022

cc @cockroachdb/cdc

@amruss
Copy link
Contributor

amruss commented Aug 31, 2022

Closing due to: #86794

@amruss amruss closed this as completed Aug 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-cdc Change Data Capture C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-cdc
Projects
None yet
Development

No branches or pull requests

2 participants