You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With #28 we are able to make the compass manager transparent and also simplify our operational life by establishing smart metrics and alerting rules.
Goals of this task is to identify which metrics / KPIs are business relevant and what the critical threshold for it are. We also have to define an action plan when such a threshold is reached which trigger a required action to bring our business back on track. Finally, alerting rules have to be configured which inform us as soon as one of the thresholds is reached.
AC:
Think about technical and business critical metrics / KPIs which give a clear indication of the quality and health of the Compass Manager => get in touch with SREs to identify missing alerts/critical metrics
Define the reason why this metric is relevant and what it represents.
Define the threshold (min <> max etc.) which indicate an service degradation or health issue of the Compass Manager. If a metric has no threshold, verify if it's for us still helpful to measure this value.
Specify the required action that has to be applied if a threshold is reached to recover the Compass Manager into a productive and healthy state
Present the results in the team to collect the feedback of the colleagues.
Implement the identify business metrics in the Compass Manager
Configure alerting rules which inform the team as soon as one of the thresholds is reached
Reasons
Improve operational quality and simplify on-call shifts by establish proper metrics/KPI measuring and alerting.
Attachments
The text was updated successfully, but these errors were encountered:
tobiscr
changed the title
Setup business critical metrics and alerting
Setup business critical metrics, define an action plan and configure alerting rules
Dec 27, 2023
tobiscr
changed the title
Setup business critical metrics, define an action plan and configure alerting rules
Identify and implement business critical metrics / KPIs, define an action plan and configure alerting rules
Dec 27, 2023
Description
With #28 we are able to make the compass manager transparent and also simplify our operational life by establishing smart metrics and alerting rules.
Goals of this task is to identify which metrics / KPIs are business relevant and what the critical threshold for it are. We also have to define an action plan when such a threshold is reached which trigger a required action to bring our business back on track. Finally, alerting rules have to be configured which inform us as soon as one of the thresholds is reached.
AC:
Reasons
Improve operational quality and simplify on-call shifts by establish proper metrics/KPI measuring and alerting.
Attachments
The text was updated successfully, but these errors were encountered: