-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
teamcity: show_trace logic test fails due to unexpected txn push #33720
Comments
@nvanbenschoten @andreimatei the problem here is that there's an extra |
I guess the right thing to do is... start logging to see what that intent is, and then take it from there. As long as all intents are on one range (as I guess they should be for that small table), each commit shouldn't leave intents behind. But maybe the intent that the update is running into is around the schema (which I guess would also be indicated by the preceding |
If the txn isn't 1pc, the intent can be observed by others, and if that's
true there can be a push if the txn record is gced before the waiter
observes it. I haven't looked at the flake in detail but maybe that
explains something.
…On Tue, Jan 15, 2019, 19:01 Andrei Matei ***@***.*** wrote:
I guess the right thing to do is... start logging to see what that intent
is, and then take it from there. As long as all intents are on one range
(as I guess they should be for that small table), each commit shouldn't
leave intents behind. But maybe the intent that the update is running into
is around the schema (which I guess would also be indicated by the
preceding querying next range at /Table/SystemConfigSpan/Start line.
Or bisect and see where it started being flaky and blame that loser. I
can't say I really want the issue at the moment :)
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#33720 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AE135LbiKIlbD-EOzxB8ePL-J2YB4p1_ks5vDheEgaJpZM4Z_nn0>
.
|
what's the key |
(I don't think we're supposed to change issue titles for test flakes cause then TC will open new issues. Right?)
|
That sounds reasonable (when I wrote my comment, I hadn't looked into the failure mode). The test could run the same query once without the trace to clean up any errant intents, assuming it is guaranteed that the schema change txn is done at that point. Of course that's just a band aid, tests of that kind are bound to be flaky. |
I don't know what to do here other than delete the test. |
35521: sql: reduce non-determinism in the show_trace logic tests r=knz a=knz Fixes #33720. First commit from #35519. The test is really about asserting the KV operations sent on behalf of SQL statements. The distsender traffic is really irrelevant. Since that's where most of the test flakes / non-determinism is coming from, simply remove it. Release note: None 35548: sqlbase: avoid a race in sqlbase.CancelChecker r=knz a=knz Fixes #35539. Prior to the distsql-ification of planNode execution, all the execution steps in a local query were interleaved (using coroutine-style concurrency), so that the calls to `(*CancelChecker).Check()` were all sequential. With distsql now it's possible for different planNode `Next()` or `startExec()` methods to be running in concurrent goroutines on multiple cores, i.e. truly parallel. This in turn requires atomic access to the counter in the cancel checker. Without atomic access, there is a race condition and the possibility that the cancel checker does not work well (some increments can be performed two times, which could cause the condition of the check to occasionally fail). Found with SQLSmith. Release note: None Co-authored-by: Raphael 'kena' Poss <[email protected]>
The following tests appear to have failed on master (test): TestLogic/local: TestLogic/local/show_trace, TestLogic/local, TestLogic
You may want to check for open issues.
#1092793:
Please assign, take a look and update the issue accordingly.
The text was updated successfully, but these errors were encountered: