Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

giantswarm / prometheus-rules Public

generated from giantswarm/template-app

Notifications You must be signed in to change notification settings
Fork 3
Star 19

Code
Issues 1
Pull requests 6
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: giantswarm/prometheus-rules

Releases · giantswarm/prometheus-rules

v4.30.0

10 Dec 13:24

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.30.0 Latest

Latest

Added

Add alerts for karpenter issues.

Assets 2

Loading

All reactions

v4.29.0

09 Dec 13:26

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.29.0

Changed

Increase time to trigger PromtailRequestsErrors alert from 15 to 25m.

Assets 2

Loading

All reactions

v4.28.0

02 Dec 08:31

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.28.0

Added

Add alert to monitor the KubeadmConfig CRs having trouble generating bootstrap data.

Changed

Ignore HelmReleases in e2e test organization namespaces for cabbage FluxHelmReleaseFailed (cilium, network-policies, coredns)

Assets 2

Loading

All reactions

v4.27.0

27 Nov 15:47

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.27.0

Added

KongProductionDeploymentNotSatisfied to alert on clusters starting with p.
KongNonProdDeploymentNotSatisfied to alert on clusters not starting with p.

Removed

Split KongDeploymentNotSatisfied into KongProductionDeploymentNotSatisfied and KongNonProdDeploymentNotSatisfied to be able to control alerting in- and outside business hours.

Assets 2

Loading

All reactions

v4.26.2

27 Nov 10:52

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.26.2

Changed

Remove label_replace from app_operator_app_info based alerts and use the cluster_id from the metric on CAPI.

Added

Add cloud-provider-controller.rules to monitor the cloud-provider-controller components across providers.
Add alerts to monitor the HelmReleases for cilium and coredns.
Add alert to monitor the HelmRelease for the vertical-pod-autoscaler-crd app.
Add alert to monitor Shield pods restarts.
Add MimirRulerTooManyFailedQueries alert to detect when Mimir ruler is failing to evaluate rules

Fixed

Fix dashboard link for MimirContinuousTestFailing alert
Fix tests so they fail if some helm template fails to render

Assets 2

Loading

All reactions

v4.26.1

19 Nov 12:59

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.26.1

Changed

MimirObjectStorageLowRate and LokiObjectStorageLowRate only check management cluster apps
MimirObjectStorageLowRate and LokiObjectStorageLowRate are less sensitive

Assets 2

Loading

All reactions

v4.26.0

19 Nov 08:56

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.26.0

Changed

Bump alloy-rules app version to 0.7.0
- Upgrades alloy to 1.4.2 to 1.5.0

Added

new MimirObjectStorageLowRate alert
new LokiObjectStorageLowRate alert

Assets 2

Loading

All reactions

v4.25.0

18 Nov 08:42

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.25.0

Changed

Mimir compactor alert: better failure detection

Added

Add new mimir continuous test alerts:
- MimirContinuousTestFailingOnWrites
- MimirContinuousTestFailingOnReads
- MimirContinuousTestMissing
- MimirContinuousTestFailing

Removed

Remove the mimir.enabled property to replace it with the MC flavor as all CAPI MCs now run Mimir.

Assets 2

Loading

All reactions

v4.24.1

12 Nov 09:46

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.24.1

Fixed

Fix MonitoringAgentDown to page when both prometheus-agent and alloy-metrics jobs are missing.

Assets 2

Loading

All reactions

v4.24.0

12 Nov 07:54

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v4.24.0

Added

Add a set of sensible alerts to monitor alloy.
- AlloySlowComponentEvaluations and AlloyUnhealthyComponents to report about alloy component state.
- LoggingAgentDown to be alerted when the logging agent is down.
- LogForwardingErrors to be alerted when the loki.write component is failing.
- LogReceivingErrors to be alerted when the loki.source.api components of the gateway is failing.
- MonitoringAgentDown to be alerted when the monitoring agent is down.
- MonitoringAgentShardsNotSatisfied to be alerted when the monitoring agent is missing any number of desired shards.

Changed

Update DeploymentNotSatisfiedAtlas to take into account the following components:
- observability-operator
- alloy-rules
- observability-gateway
Move all grafana-cloud related alerts to their own file.
Move all alloy related alerts to the alloy alert file.
Rename and move the following alerts as they are not specific to Prometheus:
- PrometheusCriticalJobScrapingFailure => CriticalJobScrapingFailure
- PrometheusJobScrapingFailure => JobScrapingFailure
- PrometheusFailsToCommunicateWithRemoteStorageAPI => MetricForwardingErrors

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 … 40 41 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.