-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Display Action Group in Alert Details page #82275
Comments
Pinging @elastic/kibana-alerting-services (Team:Alerting Services) |
A few questions:
|
My guess would be the former where we do track in some way while not executing the actions. From what I recall, this is the primary difference between muting and disabling (we still log and track what group the alert would be part of yet don't execute the actions). Though I'm sure @pmuellr has put more thought on this than I have.
Not at this time, we have safety-guards in place to prevent that here: https://github.com/elastic/kibana/blob/master/x-pack/plugins/alerts/server/alert_instance/alert_instance.ts#L69. |
I agree. Would be super nice to know what would be getting triggered, during a mute. No idea how we might make this happen tho. I think an |
Ya, it does look like maybe it's in the alert state today. This is from an alert with one active instance: $ curl -k $KBN_URLBASE/api/alerts/alert/fedea4aa-dbed-4274-a92e-2b5af3f5053f/state | json
{
"alertInstances": {
"host-1": {
"state": {},
"meta": {
"lastScheduledActions": {
"group": "threshold met",
"date": "2020-11-03T18:17:27.360Z"
}
}
}
},
"previousStartedAt": "2020-11-03T18:17:27.256Z"
} I'm think But of course, we aren't scheduling an action, so this field likely isn't set when muted or throttled, but ???. We might want to add a property as a peer of |
I like the idea of adding a |
Ya, we'll have to take a closer look at |
hahahahaha - issue #57173 (but different naming concern) took a quick look at the code, looks like the name is correct in that it only captures info on instances with scheduled actions, not muted or throttled ones: kibana/x-pack/plugins/alerts/server/task_runner/task_runner.ts Lines 237 to 250 in eb43158
If you look at |
Another thought on the implementation - while a "peer" object of
|
I was thinking we should try to get the action group id into the event log when |
Do you think it would be clearer to have a
|
Ah, hmmm. Maybe ... So, then the next question is, do we write It seems like we should, if we aren't. So ... yeah, I think any instance that shows up on this page will be from one of the There's an advantage to keeping this stuff in the alert state itself, as we don't have to query and process the event log to get the info. But since we're already doing that on this page, shouldn't impact performance / latency from the current situation. And we could add it to the alert state later, if we wanted "instant" access to it. |
Let's see if we can plumb the data we need just through the event log for now, will be the least disruptive change I think. And we def want the action group in the event log docs anyway. If we do need to put something in the state, we'll probably want to evaluate a couple of different shapes ... |
Another thought, but probably another issue/PR. We probably don't show muted state per instance in the alert details right now, but should. Longer term, we talked about throttling instances, and also "snoozing" (probably alerts and instances, where snooze is like a timed mute, different than throttle as throttle will "disengage" when the alert (/instance) status changes or the action group it's scheduling changes). |
I was incorrect when I said this. We do have the information for what action group would have been run for muted alerts and muted active alerts are already being written to the event log. |
I just caught up with the thread, I'm +1 on exploring the event log path to store the information. My gut feeling is telling me once we have event log UIs, it could be useful history information to display to users. We could develop a bar chart of the active instances in the alert details page and group the count by action groups (2hrs ago; 3 severe, 1 warning; 1hr ago 1 severe, 2 warning, etc) I think this piece was part of some original designs #56280 (comment). |
part of #64077
Addresses:
see Meta issue for details
The text was updated successfully, but these errors were encountered: