-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
scatter: downreplication takes a long time #17000
Comments
Was this fixed by your |
Try not to look too hard at the leaseholders per store graph. |
Assuming I'm squinting properly at that graph, it looks like it took around 7 minutes, so I guess it's still not fixed. |
Yeah, sorry, it's ~10m. How long would you expect it to take? |
About the same amount of time as up-replicating. The problem, though, is that's the wrong graph to be looking at, since it's including not just the down-replication time, but also the replica GC time, since the metric behind the replicas-per-store graph only gets decremented when a replica gets GC'ed, not when it gets removed from the consensus group. Usually those events are close together, but they aren't guaranteed to be. The "Range Operations" graph would be more useful. I'll check it out sometime, but it doesn't seem urgent. |
That is incredibly useful to know. Yeah, no rush at all; I may look into it too. |
I wouldn't be surprised if this were fixed now. Don't the decommissioning tests verify something similar? |
Scatter no longer scatters replicas, and when we bring that aspect of it back (#26438) it won't add all replicas before removing any. I think it's safe to close this. |
Discovered while implementing a proper scatter in #16249.
AdminScatter
is able to very quickly upreplicate (e.g., in a few minutes), but the replicate queue can take ten minutes to downreplicate. Figure out why; @a-robinson claims the downreplication should take approximately as long as the upreplication.This behavior is easy to reproduce with a four-node local cluster with preeexisting data: https://github.com/benesch/crdb-playground-scatter
The text was updated successfully, but these errors were encountered: