-
Notifications
You must be signed in to change notification settings - Fork 637
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prometheus metrics endpoint #259
Comments
@jpds I just shared a proposal about exporting node problems as logs and metrics. I think we could translate today's conditions & events into metrics. For example, for a permanent problem (node condition), it looks like this today: And for a temporary problem (event), it looks like this: I plan to translate them into counter metrics, basically counting how many times have they occurred on this node, like this: Would that be the kind of Prometheus metrics you had in mind? And if you could take a look of the doc and share your opinion, that'd also be very helpful. Thanks! |
Hi @frittentheke @jpds, the implementation of NPD metrics mode is mainly tracked at #284. It is almost complete. The only lacking thing is #315. I expect to get it finished within a few days. It'd be good if you can help verify if these changes suits your use cases. If not, we can make improvements further along the way (0.7.x). |
The changes mentioned above has been released in v0.7.0. I think we can close this issue now, right? |
I'm using v0.7.0, however, neither NPD or its metrics are showing up in Prometheus despite NPD successfully starting its prometheus exporter:
|
Hi @amagura , From your logs And if you can see something output, then NPD is working as intended. You might need to configure your Prometheus to scrap metrics from NPD's metrics endpoint. |
That must be what the issue is: running curl printed out a some metrics so NPD must be is working. Thanks for getting back to me so quickly. |
Could a
/metrics
endpoint be added to n-p-d so that tools like Prometheus can gather these and then create alerts based on those?The text was updated successfully, but these errors were encountered: