-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need an indication on how the LoadBalancingExporter works with a static list of hosts when a host is down #31209
Comments
Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
Same question for DNS and K8s, to be fair. It feels like the k8s one will cope but I'm unsure. would the DNS one need the "A" record to be changed if there was a host down? |
This was discussed via Slack, here: https://cloud-native.slack.com/archives/C01N6P7KR6W/p1707838013838759 Here's a long version of what happens behind the scenes, and I appreciate if this could be summarized and added as part of the readme: The load balancer exporter will create one exporter per endpoint, no matter the resolver (static, k8s, DNS). These exporters can be fine-tuned with options related to the sending queue and retry mechanisms. This means that if a network hiccup occurs and a data point cannot be delivered, the exporter will attempt to deliver it again periodically and might eventually fail. The load-balancing exporter will NOT attempt to re-route to a healthy endpoint. Concretely:
|
…o explain how topology changes can influence decisions around retry configuration, and how they can result in data loss.
Removing @alexchowle Would you be interested in posting a PR with README updates that explain this functionality? |
It's already been merged in |
Thank you! Sorry, missed the PR: #31271 |
Component(s)
exporter/loadbalancing
Describe the issue you're reporting
With a static configuration of hosts in the LoadBalancingExporter configuration:
If
host2
has its Collector stopped, it seems like all Spans that would normally be load-balanced to it would just be dropped instead of re-routing tohost1
. If that is the case then the README.md should make it clear.The text was updated successfully, but these errors were encountered: