-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
storage: Reinstate coalesced heartbeats #6107
Comments
For reference, see #3528, which removed multiraft. |
@spencerkimball brought up the interesting idea of measuring liveness based on Gossip as a first approximation. That is, a node would locally respond to heartbeats for all of its Raft groups as long as the recipient node had a signal in gossip (maybe the store status, or something smaller that can be gossiped more frequently). Historically we've had trouble with coalesced heartbeats and the semantics they carry in Raft, especially during asymmetrical partitions. However we reinstate them, we should be clear on what we're guaranteeing. |
BTW, take a look at @d4l3k's recent PR #7399. Effectively this adds a measure of node-to-node connection liveliness to Pulling that thread further, you could imagine not ticking Raft groups for which regular outside heartbeats exist, and ticking them only in the absence of heartbeats (so that elections would ensue), to address the large number of raft groups that would otherwise have to be touched by every tick (might be an unrealistic idea, but just throwing it out there). |
@tschottdorf I had a similar idea yesterday. It seems possible to leverage the |
Closed by #9380. |
🎉 |
Coalesced heartbeats were removed when the
multiraft
package was folded intostorage
. We should bring them back. This may help with issues seen in #5970, but the available evidence suggests this is lower priority than #6106.The text was updated successfully, but these errors were encountered: