[core] Log about individual status changes that occur after startup #116718

spalger · 2021-10-28T21:57:03Z

We just spent a ton of time tracking down a plugin which was updating it's status from degraded to available very slowly. The vast majority of the time was spent just trying to understand how statuses were changing and would have been pretty trivial if we had seen a log line like:

Status of plugin "alerting" is now available (was degraded for 5 minutes)

I think it would be very useful to log these types of messages at an info level when they occur after some initial delay (to avoid a massive flood of messages about status changes as things startup).

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-10-28T21:57:05Z

Pinging @elastic/kibana-core (Team:Core)

pgayvallet · 2021-11-22T09:50:45Z

I agree that having status changes of individual services/plugins logged could make sense, and would probably help for support too.

I'm still a little concerned about the potential verbosity of the thing. As most plugins don't register custom status handlers, their status is computed depending on the statuses of their dependencies.

In practice, that can lead to a lot of consecutive status changes. For example, if the home (data is another example) plugins changes from available to degraded, this will cause most of the plugins to get to the same state, as almost every plugin's dependency tree has either of those somewhere at the bottom of the tree, causing ~= 100 lines outputted.

We could potentially try to group these status changes by trying to identify the 'root' plugin/service that caused the change in the tree (something like 34 plugins changed from available to degraded due to a change of status coming from 'alerting'), but to be honest, this may not be easy, given the observable hell the status plugin currently is.

So, are we fine just outputting a line in the log for each individual status change, even if in most scenario, we'll have dozen of such lines in every status change batch, or do we want more than that?

spalger · 2021-12-06T18:09:32Z

I think individual log lines for any delayed status update (one that doesn't happening "immediately" after creation or something) sounds reasonable. They're all useful information that we should be making people aware of I think.

## Summary New attempt at fixing #116718 Inspired on #126320 Here's what the newly logged `[status]` information looks like on a fresh startup: <img width="1834" alt="image" src="https://github.com/elastic/kibana/assets/25349407/d78d7f88-139f-4daf-9dc0-c4e6724ea412"> The first 2 entries are logs from Core services 🆕 . The next 5 entries are emitted due to `taskManager` plugin emitting a degraded status right at startup. I have created an issue to tackle that one: #168237 --------- Co-authored-by: kibanamachine <[email protected]>

spalger added Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc enhancement New value added to drive a business result labels Oct 28, 2021

exalate-issue-sync bot added impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. loe:small Small Level of Effort loe:medium Medium Level of Effort and removed loe:small Small Level of Effort labels Oct 29, 2021

pgayvallet self-assigned this Feb 23, 2022

pgayvallet mentioned this issue Feb 24, 2022

[Status service] log plugin status changes #126320

Merged

1 task

pgayvallet closed this as completed in #126320 Mar 1, 2022

exalate-issue-sync bot reopened this Mar 15, 2022

exalate-issue-sync bot closed this as completed Mar 15, 2022

gsoldevila mentioned this issue Oct 6, 2023

Add smart logic to log information about plugin status changes #168207

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Log about individual status changes that occur after startup #116718

[core] Log about individual status changes that occur after startup #116718

spalger commented Oct 28, 2021

elasticmachine commented Oct 28, 2021

pgayvallet commented Nov 22, 2021

spalger commented Dec 6, 2021 •

edited

Loading

[core] Log about individual status changes that occur after startup #116718

[core] Log about individual status changes that occur after startup #116718

Comments

spalger commented Oct 28, 2021

elasticmachine commented Oct 28, 2021

pgayvallet commented Nov 22, 2021

spalger commented Dec 6, 2021 • edited Loading

spalger commented Dec 6, 2021 •

edited

Loading