-
Notifications
You must be signed in to change notification settings - Fork 634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to hook up third-party daemons? #35
Comments
/subscribed |
@jfilak Cool, we'd really like to integrate with third party daemons! NPD was introduced in K8s 1.3 for several reasons:
In the first version, we architecturally separated out the "problem daemon" and defined the "problem report interface", but kernel monitor (the first "problem daemon") is still in-process integrated because at that time it's the only daemon. We have plan to support inter-process integration and now it seems to be the time. :) @dchen1107 |
@jfilak Thanks for interest integrating new problem detector with k8s's generic NPD! By design, NPD should be easy to plug-in / swap with a different problem detector containers, and aggregate / report all problems to the upstream layers / the users. Do you want to give a demo on one of our sig-node meeting? |
Yes, I do. Thank you for the offer. However, I need some time to get familiar with kubernetes and to polish the image. I've been testing the image only on a bare metal with Docker so far. |
@jfilak Thanks! |
/subscribed |
@jfilak I am working on porting Nagios Plugin Interface to NPD which uses If then, how about we use a localhost http api to listen for third party daemon checker. |
Issues go stale after 90d of inactivity. Prevent issues from auto-closing with an If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Rotten issues close after 30d of inactivity. Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
I'm an ABRT developer and I would love to create a problem daemon reporting problems detected by ABRT to node-problem-detector.
ABRT's architecture is similar to node-problem-detector's - there are agents reporting detected problems to abrtd. An ABRT agent is either a tiny daemon watching logs (or systemd-journal) or a language error handler (Python sys.excepthook, Ruby at_exit callback, /proc/sys/kernel/core_pattern, Node.js uncaughtException event handler, Java JNI agent).
I've created a docker image that is capable to detect Kernel oopses, vmcores and core files on a host:
https://github.com/jfilak/docker-abrt/tree/atomic_minimal
(It should be possible to detect uncaught [Python, Ruby, Java] exceptions in the future)
ABRT provides several ways of reporting the detected problems to users - e-mail, FTP|SCP upload, D-Bus signal, Bugzilla bug, micro-Report, systemd-journal catalog message - and it is trivial to add another report destination.
The Design Doc defines "Problem Report Interface" but I've failed to find out how to register a new problem deamon to node-problem-detector or how to use the "Problem Report Interface" from a third party daemon.
The text was updated successfully, but these errors were encountered: