-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sql: lease acquisition can repeatedly contend with DDL statement when closed_timestamp target_duration and side_transport_interval are very low #89900
Comments
This test was skipped because it timed out. I believe cockroachdb#89900 is the likely cause of the timeout. Since this test doesn't depend on the shorter closed timestamp setting, we can reset them to make the timeout much less likely. Fixes cockroachdb#87129 Release note: None
89901: backupccl: unskip restore-grants r=adityamaru a=stevendanna This test was skipped because it timed out. I believe #89900 is the likely cause of the timeout. Since this test doesn't depend on the shorter closed timestamp setting, we can reset them to make the timeout much less likely. Fixes #87129 Release note: None Co-authored-by: Steven Danna <[email protected]>
Hold on, the bug here is that the last statement acquires a lease on testdb. That shouldn't happen if we've already modified testdb in the current transaction. That's a severe bug. |
@ajwerner Should this be a GA blocker then? I haven't tested it on other branches. |
I'm unable to repro this. Putting this back in triage because I don't have time to look into this further right now. |
@postamar I just tried this again, and for reasons that aren't clear to me, this is nearly impossible to reproduce for me on my macOS. But on a gceworker I can reproduce it with:
Note that the -test.timeout 3m is needed because I think often the condition clears eventually. Anyway, I trust y'alls judgement on whether this is worth looking into, just mentioned this since I found it odd that I wasn't able to reproduce it for a bit either. |
I wonder if this is caused by #91116 |
That definitely surprises me. I'd expect it to be #91116. |
This test passed under stress for 5 hours, which seems sufficient to unskip it. The underlying issue described in cockroachdb#89900 appears to be much harder to reproduce and may now be resolved. Informs cockroachdb#89900 Fixes cockroachdb#90444 Epic: none Release note: None
106115: backupccl: enable restore-grants test on tenant r=rhu713 a=stevendanna This test passed under stress for 5 hours, which seems sufficient to unskip it. The underlying issue described in #89900 appears to be much harder to reproduce and may now be resolved. Informs #89900 Fixes #90444 Epic: none Release note: None Co-authored-by: Steven Danna <[email protected]>
Describe the problem
A transaction with a DDL statement can contend with a lease-acquisition transaction when
kv.closed_timestamp.target_duration
andkv.closed_timestamp.side_transport_interval
are set very low.The following test, when run under stress, will typically hit a timeout if you set the per-test timeout to 3-5 minutes:
When running:
The first statement writes to the descriptor for testdb. The last statement results in a lease acquisition attempt on testdb. That least acquisition attempt runs in another txn that pushes the txn containing the DDL statements. But, it seems that it never pushes it far enough.
Jira issue: CRDB-20491
The text was updated successfully, but these errors were encountered: