-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kvserver: make config change failures less scary #72546
Comments
FYI: I run into this with my 6 Node Cluster. We recently moved to different servers (Dedicated Hardware instead of Cloud VMs) and the performance on the hardware was dramatically bad. The only thing I can find in the logs that look suspicious is those log lines similar to the one above.
We now moved back to the cloud servers and it looks like they are gone there too, which makes it even scarier. Even the hard is ways faster and should work out of the box extremely well, but for a reason, we can not find it is a really worse performance with SQL Latency up to 10 Seconds. |
I assure you that that log line has nothing to do with your bad performance. |
Consider looking at the |
Thanks for you hint, the network latency was around 0.5 ms between those nodes (they are plugged into the same switch), the rest of the dashboards looked even better than with the cloud instances. The only log entries I can find that look "suspicious" are the ones I posted above. Unfortunately, the metrics are no longer visible in the dashboards because the nodes were decommissioned because of the significant performance issues. |
We have marked this issue as stale because it has been inactive for |
Is your feature request related to a problem? Please describe.
Concurrency during replication changes can cause failure. These errors are benign. We should mark them as such and avoid the scary logging.
Consider:
and
cockroach/pkg/kv/kvserver/store_rebalancer.go
Line 350 in e5bc3e7
Describe the solution you'd like
We can keep logging these errors if we'd like, but at worst, with a warning, and better yet, with info.
Additional context
Relates to #41392.
Jira issue: CRDB-11200
The text was updated successfully, but these errors were encountered: