Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build out health monitoring #24691

Closed
3 tasks
Tracked by #24660
Johennes opened this issue Feb 28, 2023 · 2 comments
Closed
3 tasks
Tracked by #24660

Build out health monitoring #24691

Johennes opened this issue Feb 28, 2023 · 2 comments
Labels
A-Developer-Experience T-Epic Issue is at Epic level T-Task Tasks for the team like planning

Comments

@Johennes
Copy link
Contributor

Johennes commented Feb 28, 2023

Why?

We don’t have a good holistic overview of CI health outside of individual PRs. Some job failures are posted into rooms and some failure rates are available via external tools such as Cypress. There is, however, no way to get a holistic overview or monitor temporal evolution and things like job execution time are not tracked at all. This limits our ability to identify recurring issues and decreases confidence in the stability of our CI setup.

A related monitoring problem is that our Sentry instance is currently not very helpful, at times even called unusable. This prevents us from actually getting value out of this seemingly useful tool.

What?

We’ll build out a dashboard to monitor relevant CI health metrics such as failure rates and execution times. Additionally we’ll also make Sentry usable.

Plan

CI health monitoring

Fix Sentry

Internal references

myhours: https://app.myhours.com/#/projects/2028386/overview

@Johennes Johennes added T-Task Tasks for the team like planning Z-Infra T-Epic Issue is at Epic level labels Feb 28, 2023
@Johennes Johennes changed the title Build out health monitoring ⭐️ Build out health monitoring Mar 2, 2023
@paboum
Copy link

paboum commented Mar 9, 2023

https://en.wikipedia.org/wiki/Worse_is_better

It seems you don't have time for this research. Just pick a random one, like https://github.com/otto-de/gitactionboard, and start using it ASAP, just monitor one thing but monitor it from today. This way it will already provide value from the day one and you can use the saved capacity for continuing the integration or shifting to another tool if you really can't continue with this one.

@Johennes Johennes changed the title ⭐️ Build out health monitoring Build out health monitoring Oct 23, 2023
@Johennes
Copy link
Contributor Author

@Johennes Johennes closed this as not planned Won't fix, can't repro, duplicate, stale Oct 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-Developer-Experience T-Epic Issue is at Epic level T-Task Tasks for the team like planning
Projects
None yet
Development

No branches or pull requests

2 participants