Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ForceReplication during failovers to reduce errors in clustered scenarios #1563

Merged
merged 1 commit into from
Oct 24, 2020

Conversation

aravindyeduvaka
Copy link
Contributor

Multiple people had previously complained that whenever there is a cluster failover, they experience Connection Exceptions for at least close to a minute.
During investigation, @joroda found that this is due to the fact that config check is done roughly every 60 seconds by default and during failover it has to be done almost immediately to eliminate the errors.

This fix shows significant improvement from the tests we did. it reduces errors to around 2 seconds for the 90th percentile case compared to around 56 seconds of errors for 90th percentile right now.

@aravindyeduvaka
Copy link
Contributor Author

@NickCraver, @mgravell ping!

Copy link
Collaborator

@NickCraver NickCraver left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking good, thanks for the tweaks!

@NickCraver NickCraver merged commit b634be8 into StackExchange:main Oct 24, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants