Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

kv/kvserver: TestFlowControlUnquiescedRange failed #107864

Closed
cockroach-teamcity opened this issue Jul 30, 2023 · 1 comment · Fixed by #108454
Closed

kv/kvserver: TestFlowControlUnquiescedRange failed #107864

cockroach-teamcity opened this issue Jul 30, 2023 · 1 comment · Fixed by #108454
Assignees
Labels
A-admission-control A-kv Anything in KV that doesn't belong in a more specific category. branch-master Failures and bugs on the master branch. C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. T-kv KV Team
Milestone

Comments

@cockroach-teamcity
Copy link
Member

cockroach-teamcity commented Jul 30, 2023

kv/kvserver.TestFlowControlUnquiescedRange failed with artifacts on master @ f295bd861a3a427652b19c2254d2401ebb4a3c8e:

=== RUN   TestFlowControlUnquiescedRange
    test_log_scope.go:167: test logs captured to: /artifacts/tmp/_tmp/33e1d369c27b9c01b2b6009c561815a3/logTestFlowControlUnquiescedRange3338863982
    test_log_scope.go:81: use -show-logs to present logs inline
    test_server_shim.go:98: 
        Test server was configured to route SQL queries to a secondary tenant (virtual cluster).
        If you are only seeing a test failure when this message appears, there may be a problem
        specific to cluster virtualization or multi-tenancy.
        
        To investigate, consider using "COCKROACH_TEST_TENANT=true" to force-enable just
        the secondary tenant in all runs (or, alternatively, "false" to force-disable), or use
        "COCKROACH_INTERNAL_DISABLE_METAMORPHIC_TESTING=false" to disable all random test variables altogether.
*
* WARNING: test tenant requested by configuration, but code organization prevents start!
* use of SQL servers requires an enterprise license. see https://cockroachlabs.com/pricing?cluster=0f23fe1c-018b-42cd-9531-6edc34b309f7 for details on how to enable enterprise features
*
*
* WARNING: test tenant requested by configuration, but code organization prevents start!
* use of SQL servers requires an enterprise license. see https://cockroachlabs.com/pricing?cluster=0f23fe1c-018b-42cd-9531-6edc34b309f7 for details on how to enable enterprise features
*
*
* WARNING: test tenant requested by configuration, but code organization prevents start!
* use of SQL servers requires an enterprise license. see https://cockroachlabs.com/pricing?cluster=0f23fe1c-018b-42cd-9531-6edc34b309f7 for details on how to enable enterprise features
*
    flow_control_integration_test.go:2405: condition failed to evaluate within 3m45s: expected to track 2097152 tokens in aggregate, got 6291456
    panic.go:540: -- test log scope end --
test logs left over in: /artifacts/tmp/_tmp/33e1d369c27b9c01b2b6009c561815a3/logTestFlowControlUnquiescedRange3338863982
--- FAIL: TestFlowControlUnquiescedRange (240.35s)

Parameters: TAGS=bazel,gss,race , stress=true

Help

See also: How To Investigate a Go Test Failure (internal)

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

Jira issue: CRDB-30229

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. T-kv KV Team labels Jul 30, 2023
@cockroach-teamcity cockroach-teamcity added this to the 23.2 milestone Jul 30, 2023
@pav-kv
Copy link
Collaborator

pav-kv commented Aug 2, 2023

This is a fairly new test (April). Flaked a few times in CI recently too (so flakes both under --race and not):

flow_control_integration_test.go:2405: condition failed to evaluate within 45s: expected to track 2097152 tokens in aggregate, got 4194304

@irfansharif Looks like some kind of incorrect token counting / leak?

Reproes easily under dev test --stress pkg/kv/kvserver --filter=TestFlowControlUnquiescedRange.

@pav-kv pav-kv added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. A-admission-control A-kv Anything in KV that doesn't belong in a more specific category. labels Aug 2, 2023
craig bot pushed a commit that referenced this issue Aug 9, 2023
107355: cli: enhance diagnosing contention with redacted debug zips r=fqazi a=fqazi

Previously a redacted zip, we would exclude retries and key information for contended keys since they could contain PII data. This patch does the following:

- Adds a new builtin is_system_table_key which allows us to know if a key belongs to a system table
- Modified redacted debug zips to include data for system keys in contention tables conditionally (if they belong to system tables)
- Include retries and last_retry_reason information for queries in cluster insights to help diagnose contention

Fixes: #104593

108454: Revert "kvprober: metamorphically enable / configure kvprober" r=andrewbaptist,joshimhoff a=nvanbenschoten

This reverts (most of) commit 769ba1c.

That commit metamorphically enabled kvprober. This has been observed to be destabliziing to unit tests. When the metamorphic constant is enabled (50% of the time) and when kvprober is fast enough, random ranges will see extra requests that they aren’t expecting. This adds nondeterminism which can trip up tests in any number of different ways.

All of the following flakes have been tracked back to kvprober:

Fixes #107864.
Fixes #108242.
Fixes #108441.
Fixes #108349.
Fixes #108124.
Closes #108366.

Release note: None

Co-authored-by: Faizan Qazi <[email protected]>
Co-authored-by: Nathan VanBenschoten <[email protected]>
@craig craig bot closed this as completed in 1374908 Aug 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-admission-control A-kv Anything in KV that doesn't belong in a more specific category. branch-master Failures and bugs on the master branch. C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. T-kv KV Team
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants