Skip to content

Releases: giantswarm/prometheus-rules

v3.14.1

15 May 12:04
970d481
Compare
Choose a tag to compare

Fixed

  • Fix resource estimation recording rules for clusters that have more than 1 prometheus.

v3.14.0

15 May 09:18
d691d5e
Compare
Choose a tag to compare

Added

  • Add recording rules to show prometheus scraping job memory usage.
  • Add cluster_control_plane_unhealthy inhibition.
  • Add inhibitions expressions for CAPI clusters.
  • Add ops-recipe for KeyPairStorageAlmostFull alert
  • Add missing opsrecipe for Mimir alerts.
  • Add opsrecipe to CoreDNSMaxHPAReplicasReached
  • make targets for pint linter

Changed

  • Replace cancel_if_apiserver_down with cancel_if_cluster_control_plane_unhealthy

Fixed

  • Fix shield alert labels for Mimir.
  • Fix cabbage alert labels for Mimir.
  • Fix honeybadger alert labels for Mimir.
  • Fix cert-manager alert labels for Mimir.
  • Fix operatorkit alert labels for Mimir.
  • Fix all mixins according to pint recommendations.
  • Fix etcd alert labels for Mimir.
  • Fix apiserver alert labels for Mimir.

Removed

  • Removed apiserver_down inhibition dummy trigger.
  • Remove cilium entry from KAAS SLOs.
  • Remove elasticsearch and tempo related alerts and recording rules.

v3.13.1

30 Apr 12:31
e8bb1be
Compare
Choose a tag to compare

Removed

  • Removed alerts for absent crsync deployments. They cause false alerts because the rules apply to all prometheus instances on the MC.

v3.13.0

30 Apr 08:07
303e45d
Compare
Choose a tag to compare

Added

  • Added alerts for absent crsync deployments.

Changed

  • Update LokiRingUnhealthy query to avoid false positive when a new pod is starting.
  • Changed DeploymentNotSatisfiedBigMac alert to work for teleport related deployments only on CAPI flavored clusters

v3.12.2

25 Apr 18:00
3ea83a0
Compare
Choose a tag to compare

Fixed

  • Removed check for Teleport operators in DeploymentNotSatisfiedBigMac alert as it is not valid on vintage

v3.12.1

25 Apr 16:21
5934356
Compare
Choose a tag to compare

Fixed

  • Fix alerting rules for crsync.

v3.12.0

19 Apr 12:04
cfa5fd0
Compare
Choose a tag to compare

Changed

  • Update ops-recipe link for promtail alerts.
  • Remove Linkerd form Service SLO alerts.
  • Include all Linkerd Namespaces in LinkerdDeploymentNotSatisfied alert.
  • Make LinkerdDeploymentNotSatisfied alert business hours only.

v3.11.2

18 Apr 09:29
8286293
Compare
Choose a tag to compare

Added

  • Add ops recipe for flux being suspended for too long alert.

v3.11.1

17 Apr 08:59
d3806b6
Compare
Choose a tag to compare

Added

  • Add CAPI and CAPA dashbaord to the coresponding alerts.

Fixed

  • link to PrometheusMissingGrafanaCloud opsrecipe

v3.11.0

15 Apr 11:10
8db31bb
Compare
Choose a tag to compare

Added

  • Add CiliumFailedNetworkPolicy alert.