Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sql: flake in TestCCLLogic_regional_by_row due to error in colflow.(*vectorizedFlow).Cleanup() #91845

Closed
knz opened this issue Nov 14, 2022 · 0 comments · Fixed by #91969
Closed
Assignees
Labels
A-sql-execution Relating to SQL execution. C-test-failure Broken test (automatically or manually discovered). T-sql-queries SQL Queries Team

Comments

@knz
Copy link
Contributor

knz commented Nov 14, 2022

Found here:
https://teamcity.cockroachdb.com/buildConfiguration/Cockroach_BazelEssentialCi/7503537?showRootCauses=false&expandBuildChangesSection=true&expandBuildDeploymentsSection=true&expandBuildProblemsSection=true&expandBuildTestsSection=true

panic: expected 2 components to be closed, but found that only 1 were
goroutine 98566 [running]:
github.com/cockroachdb/cockroach/pkg/sql/colexecerror.InternalError(...)
  github.com/cockroachdb/cockroach/pkg/sql/colexecerror/error.go:193
github.com/cockroachdb/cockroach/pkg/sql/colflow.(*vectorizedFlow).Cleanup(0xc0207a1db0, {0x6916fb8, 0xc02be73ec0})
  github.com/cockroachdb/cockroach/pkg/sql/colflow/vectorized_flow.go:395 +0x405
github.com/cockroachdb/cockroach/pkg/sql/flowinfra.(*RemoteFlowRunner).RunFlow.func1.2()
  github.com/cockroachdb/cockroach/pkg/sql/flowinfra/remote_flow_runner.go:106 +0x85
github.com/cockroachdb/cockroach/pkg/sql/flowinfra.(*RemoteFlowRunner).RunFlow.func1.3()
  github.com/cockroachdb/cockroach/pkg/sql/flowinfra/remote_flow_runner.go:114 +0x38
created by github.com/cockroachdb/cockroach/pkg/sql/flowinfra.(*RemoteFlowRunner).RunFlow.func1
  github.com/cockroachdb/cockroach/pkg/sql/flowinfra/remote_flow_runner.go:112 +0x2ae

cc @yuzefovich for triage.

Jira issue: CRDB-21455

@knz knz added C-test-failure Broken test (automatically or manually discovered). A-sql-execution Relating to SQL execution. T-sql-execution labels Nov 14, 2022
@blathers-crl blathers-crl bot added the T-sql-queries SQL Queries Team label Nov 14, 2022
craig bot pushed a commit that referenced this issue Nov 14, 2022
91635: jobs: add a comment about the Resumer contract r=andreimatei a=andreimatei

Release note: None
Epic: None

91754: zip: move `--redact-logs` deprecation warning to end of zip output r=dhartunian a=abarganier

This patch simply moves the deprecation notice for the `--redact-logs` flag to the bottom of the debug zip output. Previously, the message was logged at the beginning of the output, which was quickly drowned out by the rest of the output indicating the debug zip progress.

Release note: none

Addresses #91685

Epic: CRDB-12732

91830: sql/importer: skip a flaky test r=rafiss a=knz

Informs #91828.
Informs  #91845.
Informs #91850.

Co-authored-by: Andrei Matei <[email protected]>
Co-authored-by: Alex Barganier <[email protected]>
Co-authored-by: Raphael 'kena' Poss <[email protected]>
@yuzefovich yuzefovich self-assigned this Nov 14, 2022
craig bot pushed a commit that referenced this issue Nov 14, 2022
91160: sql: do not print stack trace when logging if txn is not open r=yuzefovich a=yuzefovich

After executing each statement, that statement might be logged. If there
were any audit events, then we attempt to resolve the table names for
which the audit events have occurred. To do the resolution we're using
the current txn. Previously, if that txn has been aborted or committed,
it would result in a scary-looking stack trace added to the log, and
this commit fixes it.

Epic: None

Release note: None

91563: jobs: clear job claim after execution r=ajwerner a=stevendanna

Since #89014 the job system reset a job's claim when transitioning it from pause-requested to paused and from cancel-requested to reverting. The job system signals these transitions to the running Resumer by cancelling the job's context and does not wait for the resumer to exit. Once the claim is clear, another node can adopt the job and start running it's OnFailOrCancel callback. As a result, clearing the context makes it more likely that OnFailOrCancel executions will overlap with Resume executions.

In general, Jobs need to assume that Resume may still be running while OnFailOrCancel is called. But, making it more likely isn't in our interest.

Here, we only clear the lease when we exit the job state machine. This makes it much more likely that OnFailOrCancel doesn't start until Resume has returned.

Epic: None

Release note: None

91874: colflow: temporarily disable test assertion about closers r=yuzefovich a=yuzefovich

Informs: #91845.

Epic: None

Release note: None

Co-authored-by: Yahor Yuzefovich <[email protected]>
Co-authored-by: Steven Danna <[email protected]>
@craig craig bot closed this as completed in 7ffaece Nov 22, 2022
@mgartner mgartner moved this to Done in SQL Queries Jul 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-sql-execution Relating to SQL execution. C-test-failure Broken test (automatically or manually discovered). T-sql-queries SQL Queries Team
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants