ApplicationScope: add `health` core scope #133

hongchaodeng · 2019-09-09T22:20:57Z

No description provided.

hongchaodeng · 2019-09-09T22:21:48Z

@technosophos @mikkelhegn
Please help review it :)

resouer · 2019-09-10T00:08:49Z

4.application_scopes.md

@@ -133,31 +133,43 @@ The health scope on its own does not take any action based on health status. It
 - Application upgrade traits can monitor the aggregate health of a health scope and decide when to initiate an automatic rollback.
 - Monitoring applications can monitor the aggregate health of a health scope to issue alerts.

-#### Example


I think this is still an example?

My impression goes that Core Scope should be defined explicitly, not as an example.
We will then add examples on how to use Health scope once https://github.com/microsoft/hydra-spec/issues/130 is addressed.

I concur that this should be defined explicitly. But should we add some way for extension？

Users can set properties beyond what was defined. The Hydra model itself is extensible by design.

mikkelhegn · 2019-09-10T09:40:43Z

The idea with the health scope is to evaluate health across multiple components, to use in e.g. an upgrade scenario to monitor whether upgrading an individual component is triggering failures on dependent components. This is needed to be able to trigger roll-backs based on a set of defined thresholds. This concept exists in Service Fabric, and we would like to carry it over: https://docs.microsoft.com/en-us/azure/service-fabric/service-fabric-application-upgrade#health-checks-during-upgrades.

I agree this should be explicit, as it's a core scope, but I think the current example is closer to what we need. We need a set of parameters for the health scope to know when to report error, warning or good, and it would be great to have explicit rules as well as average thresholds. E.g. overall % of components healthy threshold, and an option to deem specific components required to be healthy for the scope to be health etc.

hongchaodeng · 2019-09-10T21:08:51Z

@mikkelhegn Thanks for the suggestion. The docs you shared looks a great learning resource!

I have added the an option to let user provide a list of components that should be healthy to consider the scope healthy. This is a very useful parameter since

I have added back the healthThresholdPercentage. TBH, we haven't really seen any use cases like this. If a Deployment only needs partial to be healthy to proceed upgrade, we just pass it to the workload controller to determine itself wether to report healthy in the Component instance (e.g. Deployment). I will let you define this since you have more experience on it.

Health scope requires some way to report health status. We define the "some way" here as the probe. The existing log query is one type of probe, but seems limited. That's why we try to make it more extensible and define the probe. We need to support probing status subresource, http endpoints, prom queries, etc. The probe-method/endpoint/timeout/interval are ubiquitous to support all of these.

Let me know what you think. Thanks very much!

suhuruli · 2019-09-13T03:37:51Z

I will wait for @technosophos to review this before I merge this. We have some folks travelling/on leave this week. This will get looked at next Monday PST :)

ApplicationScope: add health core scope

bdad754

hongchaodeng requested review from mikkelhegn, technosophos and vturecek as code owners September 9, 2019 22:20

hongchaodeng mentioned this pull request Sep 9, 2019

Issue tracker of sprint work #131

Closed

11 tasks

resouer reviewed Sep 10, 2019

View reviewed changes

address comments

532cb7e

hongchaodeng force-pushed the scope_health branch from 1926c24 to 532cb7e Compare September 10, 2019 20:56

minor

2da16c3

mikkelhegn approved these changes Sep 12, 2019

View reviewed changes

suhuruli approved these changes Sep 13, 2019

View reviewed changes

mikkelhegn merged commit 5060434 into oam-dev:master Sep 13, 2019

hongchaodeng deleted the scope_health branch September 13, 2019 14:46

This was referenced Oct 9, 2019

WIP add application scopes framework and healthscope oam-dev/rudr#160

Closed

Feature: add application scopes framework and imp health scope oam-dev/rudr#367

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ApplicationScope: add `health` core scope #133

ApplicationScope: add `health` core scope #133

hongchaodeng commented Sep 9, 2019

hongchaodeng commented Sep 9, 2019

resouer Sep 10, 2019

hongchaodeng Sep 10, 2019

wonderflow Sep 10, 2019 •

edited

Loading

hongchaodeng Sep 10, 2019

mikkelhegn commented Sep 10, 2019

hongchaodeng commented Sep 10, 2019

suhuruli commented Sep 13, 2019

ApplicationScope: add health core scope #133

ApplicationScope: add health core scope #133

Conversation

hongchaodeng commented Sep 9, 2019

hongchaodeng commented Sep 9, 2019

resouer Sep 10, 2019

Choose a reason for hiding this comment

hongchaodeng Sep 10, 2019

Choose a reason for hiding this comment

wonderflow Sep 10, 2019 • edited Loading

Choose a reason for hiding this comment

hongchaodeng Sep 10, 2019

Choose a reason for hiding this comment

mikkelhegn commented Sep 10, 2019

hongchaodeng commented Sep 10, 2019

suhuruli commented Sep 13, 2019

ApplicationScope: add `health` core scope #133

ApplicationScope: add `health` core scope #133

wonderflow Sep 10, 2019 •

edited

Loading