-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Fleet] Prioritize showing agents as offline #140477
Comments
Pinging @elastic/fleet (Team:Fleet) |
Related to #122206 |
@nchaulet Putting this high priority bug on your list. |
Before I start working on that @paul-tavares @kevinlog I know endpoint is using Fleet status somehow does this change make sense to you? |
@nchaulet - apologies, I missed this this ping earlier. Thank you for the heads up. We are using the Fleet status in some places, but just to show users what it is. We don't rely on the status for any logic. I think it's OK to change the priority of statuses in Fleet to make more sense for customers. We have recently added features to bubble up Endpoint specific errors in the Fleet Agent details UI, but this, again, isn't dependent on Agent showing that it is "Unhealthy". We detect these errors from the Endpoint policy response itself. So changing the way we show Offline is fine. |
@nchaulet - @kevinlog is correct - I don't think we are impacted by this change. We use a few pieces of data from the Agent in enriching our endpoint metadata API here: Lines 278 to 300 in 368eba6
Specifically to the statuses, we use the Lines 11 to 23 in 6efef04
|
In our agent status calculation code, we have some prioritization to prioritize signals of one status over others. This is done here:
kibana/x-pack/plugins/fleet/common/services/agent_status.ts
Lines 16 to 51 in 8016007
Since the offline status is second to last, this can result in a few common situations where an agent really is not able to check in at all and offline showing up in other statuses:
These situations may give the false impression that there's something that the user can do take action on this agent from the UI, when in fact there is not because the Agent is not able to check in with Fleet Server at all.
One simple solution could be to always show these agents as offline in the agent table view if they haven't checked in the required window but still show the other status information on the agent detail page.
The text was updated successfully, but these errors were encountered: