Update a child's reachable state as soon as possible #10143

nilmerg · 2024-08-30T11:19:00Z

Given a three level dependency hierarchy, the child on the lowest level is not unreachable in case a parent on the first level goes down, unless a check result arrives afterwards.

graph LR;
    ChildHost-->pa["ParentHostA (Group 1)"];
    ChildHost-->pb["ParentHostB (Group 1)"];
    pa-->ga["GrandParentA (Group 2)"];
    pa-->gb["GrandParentB (Group 2)"];
    pb-->ga["GrandParentA (Group 2)"];
    pb-->gb["GrandParentB (Group 2)"];

Here, ChildHost must be unreachable once both, GrandParentA and GrandParentB, are down. Sooner or later.

Expected behavior

I'm unsure whether we should update every child's reachable state in such a case. Since this will be updated anyway, once a check is performed, it is highly dependent on the interval of the check:

If the object is currently UP/OK, there's no real need to update it, since everything "is fine" and no-one should worry about it. Though, what if the object already has a problem? In case the check interval is relatively high, the reachable state is not going to update soon enough. But of course, the reason why it has already a problem, might not necessarily be related to a parent. So, maybe update such an object only if it's state is UNKNOWN?

Another very different case though, is in case the dependency configuration mandates that checks get disabled once the parent goes down. Then the child will never be checked again and the reachable state won't be updated at all without an explicit check issued by a user. Here I expect that, before disabling checks, the reachable state must be updated as well.

--

So, I'm certain what to expect in cases where checks will be disabled. But if that's not the case, is it really required to traverse the entire hierarchy to every child related in some way to the parent in question?

nilmerg mentioned this issue Sep 17, 2024

Track effect of an object on dependent children #10158

Open

Al2Klimov added the area/runtime Downtimes, comments, dependencies, events label Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update a child's reachable state as soon as possible #10143

Update a child's reachable state as soon as possible #10143

nilmerg commented Aug 30, 2024

Update a child's reachable state as soon as possible #10143

Update a child's reachable state as soon as possible #10143

Comments

nilmerg commented Aug 30, 2024

Expected behavior