Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ccl/changefeedccl: TestNoStopAfterNonTargetAddColumnWithBackfill failed #129045

Closed
cockroach-teamcity opened this issue Aug 15, 2024 · 4 comments
Closed
Assignees
Labels
A-cdc Change Data Capture branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. P-2 Issues/test failures with a fix SLA of 3 months T-cdc X-unactionable This was closed because it was unactionable.

Comments

@cockroach-teamcity
Copy link
Member

cockroach-teamcity commented Aug 15, 2024

ccl/changefeedccl.TestNoStopAfterNonTargetAddColumnWithBackfill failed on master @ 575cdd4696dfcac8f311d1ea546683271102f73e:

test logs left over in: outputs.zip/logTestNoStopAfterNonTargetAddColumnWithBackfill2410879673
--- FAIL: TestNoStopAfterNonTargetAddColumnWithBackfill (15.18s)
=== RUN   TestNoStopAfterNonTargetAddColumnWithBackfill/sinkless
    helpers_test.go:872: making server as secondary tenant
    helpers_test.go:951: making sinkless feed factory
    helpers_test.go:1016: pgURL sinkless SinklessFeedUser
    helpers_test.go:1016: pgURL sinkless root
    testfeed_test.go:273: sinkless feed creating changefeed: CREATE CHANGEFEED FOR TABLE hasfams FAMILY b_and_c WITH schema_change_policy='stop'
    changefeed_test.go:1787: 
        	Error Trace:	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:200
        	            				github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:275
        	            				github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/changefeed_test.go:1787
        	            				github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:1123
        	            				github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:1158
        	Error:      	Received unexpected error:
        	            	expected message
        	            	(1) assertion failure
        	            	Wraps: (2) attached stack trace
        	            	  -- stack trace:
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.readNextMessages
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:120
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.assertPayloadsBaseErr
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:211
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.assertPayloadsBase.func1
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:203
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.withTimeout.func1
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:268
        	            	  | github.com/cockroachdb/cockroach/pkg/util/timeutil.RunWithTimeout
        	            	  | 	github.com/cockroachdb/cockroach/pkg/util/timeutil/timeout.go:33
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.withTimeout
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:264
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.assertPayloadsBase
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:201
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.assertPayloads
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:275
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.TestNoStopAfterNonTargetAddColumnWithBackfill.func1
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/changefeed_test.go:1787
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.cdcTestNamed.func1
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:1123
        	            	  | github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl.cdcTestNamedWithSystem.func1
        	            	  | 	github.com/cockroachdb/cockroach/pkg/ccl/changefeedccl/helpers_test.go:1158
        	            	  | testing.tRunner
        	            	  | 	GOROOT/src/testing/testing.go:1689
        	            	  | runtime.goexit
        	            	  | 	src/runtime/asm_amd64.s:1695
        	            	Wraps: (3) expected message
        	            	Error types: (1) *assert.withAssertionFailure (2) *withstack.withStack (3) *errutil.leafError
        	Test:       	TestNoStopAfterNonTargetAddColumnWithBackfill/sinkless
    testfeed_test.go:280: closing sinkless feed
    --- FAIL: TestNoStopAfterNonTargetAddColumnWithBackfill/sinkless (15.17s)

Parameters:

  • attempt=1
  • deadlock=true
  • run=1
  • shard=6
Help

See also: How To Investigate a Go Test Failure (internal)

/cc @cockroachdb/cdc

This test on roachdash | Improve this report!

Jira issue: CRDB-41352

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. T-cdc labels Aug 15, 2024
@blathers-crl blathers-crl bot added the A-cdc Change Data Capture label Aug 15, 2024
@andyyang890 andyyang890 self-assigned this Aug 16, 2024
@andyyang890 andyyang890 removed the release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. label Aug 16, 2024
@andyyang890
Copy link
Collaborator

Reviewing the logs, it didn't hit either of the core changefeed error logs in #127530 nor the timeout log in #127553. It seems like the test server just shut down randomly with a server shutting down: instructing cmux to stop accepting message. Spot-checking a few similar past failures we had, they were all running with the secondary tenant:

Asked for help from #multi-tenant here: https://cockroachlabs.slack.com/archives/C02HWA24541/p1723839743273609

@stevendanna
Copy link
Collaborator

Looking at the logs from just this failure, it looks to me like the schema change stopped the feed despite our expectation that it wouldn't.

I240815 09:30:27.016476 14927345 ccl/changefeedccl/kvfeed/kv_feed.go:155 ⋮ [T10,Vcluster-10,nsql1,client=127.0.0.1:60082,hostssl,user=‹sinklessfeeduser›] 404  stopping kv feed due to schema change at 1723714222.413838743,1

@andyyang890
Copy link
Collaborator

Thanks for taking a look.

My interpretation (which might be wrong) was that the changefeed was going to restart, but I guess we can't really tell from the error message since the same error is returned for both restart and exit (aside: we have an issue to improve observability for this #124635):

if boundaryType == jobspb.ResolvedSpan_RESTART || boundaryType == jobspb.ResolvedSpan_EXIT {
return schemaChangeDetectedError{highWater.Next()}
}

I think some other evidence that it might not be because the changefeed stopped is that I don't see the logs that were added in this PR: #127530

@rharding6373 rharding6373 added the P-2 Issues/test failures with a fix SLA of 3 months label Sep 3, 2024
craig bot pushed a commit that referenced this issue Oct 2, 2024
131732: ci: update bazel builder image r=rickystewart a=cockroach-teamcity

Release note: None
Epic: None


131780: changefeedccl: enable verbose logs for a few schema change unit tests r=rharding6373 a=andyyang890

**changefeedccl/kvfeed: add debug logs around schema change boundary types**

Release note: None

----

**changefeedccl: add debug logs around when processors move to draining**

Release note: None

----

**changefeedccl: enable verbose logs for a few schema change unit tests**

We're seeing that some changefeed unit tests involving schema changes
are sometimes flaking when using a core changefeed. This patch enables
verbose logs that will provide more information about what the kv feed
is doing when it sees the schema change and when processors are moving
to draining.

Release note: None

----

Informs #131186
Informs #130404
Informs #129777
Informs #129226
Informs #129045

Co-authored-by: cockroach-teamcity <[email protected]>
Co-authored-by: Andy Yang <[email protected]>
@rharding6373
Copy link
Collaborator

The test artifacts have expired. We added more observability and more verbose logging in case it happens again. Closing as unactionable.

@rharding6373 rharding6373 added the X-unactionable This was closed because it was unactionable. label Jan 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-cdc Change Data Capture branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. P-2 Issues/test failures with a fix SLA of 3 months T-cdc X-unactionable This was closed because it was unactionable.
Projects
None yet
Development

No branches or pull requests

4 participants