-
Notifications
You must be signed in to change notification settings - Fork 522
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gossip ring memberlist - cross-connect within namespace #4273
Comments
While exploring memberlist module , came across a comment from @joe-elliott in this thread for LOKI #2766 (comment) . it says I believe this happens due to IP reuse in k8s. After a node in memberlist disappears the cluster will still reach out to it for a certain timeout period. If a Tempo cluster has a pod shutdown and a Loki cluster has a pod start up and claim its IP within the timeout then the two memberlist clusters can join together. We generally saw this occurring when rolling out two clusters at the same time. Seems the case is same here . Good to know this behaviour . |
Thanks for reporting this and the solution. I'm considering submitting a PR to the helm charts repo that sets the https://github.com/grafana/helm-charts/blob/main/charts/tempo-distributed/values.yaml#L22 Do you think that would help? As long as people named their Tempo clusters differently it would prevent this and it would definitely prevent it from occurring between Tempo/Mimir or Tempo/Loki |
Thank You. just like in this case of manual fix I may be wrong, but wouldn't this be a simpler approach than Values.fullnameOverride. |
I think you have the right idea. I honestly don't know much about the helm chart so I was just guessing. Will put up a PR if you could give it a review 🙏 |
Looks like someone already did: https://github.com/grafana/helm-charts/pull/3058/files I'm going to clean this person's PRs up and get them mergeable. Obviously we missed this. EDIT: PRs updated. if you get a chance to review it would be appreciated! |
Thats great :) . |
Tested and working good as expected.
|
Describe the bug
While it may not be ideal, i have two distributed instances of Tempo running in the same Namespace
Both are separated clearly by its own labels and its own gossip-ring service .
While everything was running fine for almost an year , suddenly i noticed Traces were getting created in both stores.
Even it was found in S3. Metrics were showing cross-tenant traces as well.
Memberlist was expected to working fine with this. However it didn't.
Based on some events thrown by the members, A workaround was attempted to include this param
This helped to fix the issue .
Am wondering , why headless service didnt work as expected. Tried to explore the memberlist module , but haven't got a clarity yet.
To Reproduce
Deploy with in namespace , Check the member list via API query .
Expected behavior
Ideally only own headless service associated members only expected
Environment:
k8s. helm chart - tempo-distributed-1.7.3
Image: tempo:2.3.1-amd64
Additional Context
The text was updated successfully, but these errors were encountered: