-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: replicagc-changed-peers/restart=false failed #98966
Comments
Digging the logs, I can see that remaining replicas listed in failure message are being removed, but removals are happening longer than the test waits. Removal happens past the retry duration. |
Seems fine. |
I rerun that locally and i think the problem is not that we don't give it enough time (5 min), but moving replicas off node only starts after 5 minutes. So if we are lucky (because of exponential backoff), then once 5 min hits, we quickly remove everything and test succeeds, but if attempt happens at 5 min mark then we fail. I'll rerun with verbose allocator logs to see why it isn't picking up replicas. |
So 5 minutes is |
How about reducing |
|
Write up an issue? |
Forgot to reference it here. But here it is #99064 |
roachtest.replicagc-changed-peers/restart=false failed with artifacts on release-22.2 @ 73df0dc428ff1e516c5b11a7ca1930e3339da118:
Parameters:
ROACHTEST_cloud=gce
,ROACHTEST_cpu=4
,ROACHTEST_encrypted=false
,ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
This test on roachdash | Improve this report!
Jira issue: CRDB-25629
The text was updated successfully, but these errors were encountered: