Sysdig Trigger #45

jlangy · 2021-11-05T18:24:16Z

jlangy
Nov 5, 2021
Maintainer

Starting a thread for a curious sysdig alert that came up. In sysdig we have an alert for the number of ready patroni pods, using the formula sum(avg(kubernetes.pod.status.ready)). We have 3 pods, and a Low severity alert if that drops below 3.

When one of the pods spiked in CPU, it caused the sysdig trigger to go off (dropped to 2.98 for a bit), the CPU spike is below:

Strange thing is no events got logged in openshift, even though sysdig showed it drop:

I would think if one pod had lost its ready status the kuberentes API would log an event.

Wondering if I am misinterpreting something here, or maybe a pod can lose its ready status temporarily without logging an event?

NB:As a side note, I dropped to measure the avg < 2.5, might make more sense to use the sum(min(kubernetes.pod.status.ready)) though instead

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sysdig Trigger #45

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Sysdig Trigger #45

jlangy Nov 5, 2021 Maintainer

Replies: 0 comments

jlangy
Nov 5, 2021
Maintainer