-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: failover/chaos/read-write/lease=expiration failed #119361
Comments
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ a36097be277adef635f55d317579ca79b450bfef:
Parameters:
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ c9c3cc5f3c3a4a6ab556f4b9d5b6ec0381901bdb:
Parameters:
Same failure on other branches
|
Dupe of #119085 |
Previously, the multiple failures were started and finished independently. This caused a problem if the ability to recover from one failure depended on a different failure recovering first. To mitigate this and to add a little more chaos, start and recover each failure in a seperate goroutine. This will allow the "most important" failure to recover first so that the others can recover if they depend on each other. Note that this is more important today while we don't support all the failure modes that the chaos implements. Specifically we don't handle partial partitions handling yet. Epic: none Fixes: cockroachdb#119085 Fixes: cockroachdb#119347 Fixes: cockroachdb#119361 Fixes: cockroachdb#119454 Release note: None
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ bf013ea0a5311726e65d37e8f047ce39ea2d5f10:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ ce5f34ea97475f45fa354e58aacf424779d0de49:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ 067e48d29b9093038f6fcf2074cd761ffdcd4fe2:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ c561383c9d86a52cf63a2c8aaa3ee20270635f11:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ c994982a8be5af89f594e115e897dd6d62cf99d8:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ c43f54cdde5b7578f4a0ca61de41463f0d690993:
Parameters:
Same failure on other branches
|
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ 7dfff9430b0aedaac5bb57e06d704a527336863e:
Parameters:
Same failure on other branches
|
119650: roachtest: make failure recovery independent r=nvanbenschoten a=andrewbaptist Previously, the multiple failures were started and finished independently. This caused a problem if the ability to recover from one failure depended on a different failure recovering first. To mitigate this, recover each failure in a separate goroutine. This will allow the "most important" failure to recover first so that the others can recover if they depend on each other. This is more important today while we don't recover from all the failure modes that chaos implements. Specifically we don't handle partial partitions fully with epoch leases. Epic: none Fixes: #119085 Fixes: #119347 Fixes: #119361 Fixes: #119454 Release note: None 122283: server: don't log for missing locality r=yuzefovich a=andrewbaptist Previously we would always log a message that the locality was unknown for requests from sql gateways. We should remove unnecessary logs from traces. Epic: none Release note: None Co-authored-by: Andrew Baptist <[email protected]>
Previously, the multiple failures were started and finished independently. This caused a problem if the ability to recover from one failure depended on a different failure recovering first. To mitigate this, recover each failure in a separate goroutine. This will allow the "most important" failure to recover first so that the others can recover if they depend on each other. This is more important today while we don't recover from all the failure modes that chaos implements. Specifically we don't handle partial partitions fully with epoch leases. Epic: none Fixes: #119085 Fixes: #119347 Fixes: #119361 Fixes: #119454 Release note: None
Previously, the multiple failures were started and finished independently. This caused a problem if the ability to recover from one failure depended on a different failure recovering first. To mitigate this, recover each failure in a separate goroutine. This will allow the "most important" failure to recover first so that the others can recover if they depend on each other. This is more important today while we don't recover from all the failure modes that chaos implements. Specifically we don't handle partial partitions fully with epoch leases. Epic: none Fixes: #119085 Fixes: #119347 Fixes: #119361 Fixes: #119454 Release note: None
roachtest.failover/chaos/read-write/lease=expiration failed with artifacts on master @ e39dafe6d8c153301ff43ed2b3ed3e13af9ec72a:
Parameters:
ROACHTEST_arch=amd64
ROACHTEST_cloud=gce
ROACHTEST_coverageBuild=false
ROACHTEST_cpu=2
ROACHTEST_encrypted=false
ROACHTEST_fs=ext4
ROACHTEST_localSSD=false
ROACHTEST_metamorphicBuild=false
ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
See: Grafana
This test on roachdash | Improve this report!
Jira issue: CRDB-36166
The text was updated successfully, but these errors were encountered: