-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove/propose different "no metrics known for pod" log #349
Comments
/help |
@serathius: Please ensure the request meets the requirements listed here. If this request no longer meets these requirements, the label can be removed In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
There is a same metric-server problem in my cluster: unable to fetch pod metrics for pod kube-system/POD_NAME:no metrics known for pod. |
I had the same problem before。 resources: |
Hey @zhangyu84848245, |
/assign |
Possible solutions (in order of implementation complexity):
Where I propose to remove error logs, as those error logs can be caused by:
Other ways we can improve visibility in metric availability delay:
Both option 2 & 3 try to add logic to guess health of metrics pipeline. They complicate code without providing any additional benefits. Measuring health of pipeline should be done via defining proper metrics and defining externally monitored SLOs. /cc @s-urbaniak |
/cc @kawych |
Thank you for this @serathius , I was concerned about the issue but noticed it would stop logging after some time, everything is so far looking good 🙇 |
ping @s-urbaniak |
I agree with just going forward with option 1. I think options 2. and 3. should be solved via a higher level alerting system. |
Looks like work was done |
There are a lot of issues where users are seeing metrics-server reporting error "no metrics known for pod" and asking for help.
To my understanding this error is expected to occur in normal healthy metrics-server.
Metrics Server periodically scrapes all nodes to gather metrics and populate it's internal cache. When there is a request to Metrics API, metrics-server reaches to this cache and looks for existing value for pod. If there is no value for existing pod in k8s, metrics server reports error "no metrics known for pod". This means this error can happen in situation when:
Providing better information to users would greatly reduce throughput of tickets.
The text was updated successfully, but these errors were encountered: