Skip to content

Commit

Permalink
Removed changes from the examples package
Browse files Browse the repository at this point in the history
Updated the description of the alerts
Added for: 1m to the alerts to wait a loop of auto restart

Signed-off-by: lhunyady <[email protected]>
  • Loading branch information
Laszlo committed Jul 8, 2024
1 parent eab4a77 commit 8de09c0
Show file tree
Hide file tree
Showing 3 changed files with 6 additions and 16 deletions.
2 changes: 2 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,7 @@
# CHANGELOG

* Added alerts for Connectors/Tasks in failed state.

## 0.41.0

* Add support for Apache Kafka 3.6.2
Expand Down
14 changes: 0 additions & 14 deletions examples/metrics/prometheus-install/prometheus-rules.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -159,20 +159,6 @@ spec:
annotations:
summary: 'All Kafka Connect containers down or in CrashLookBackOff status'
description: 'All Kafka Connect containers have been down or in CrashLookBackOff status for 3 minutes'
- alert: ConnectFailedConnector
expr: sum(kafka_connect_connector_status{status="failed"}) > 0
labels:
severity: major
annotations:
summary: 'Kafka Connect Connector Failure'
description: 'Some connectors are failing, this can not be automatically recovered.'
- alert: ConnectFailedTask
expr: sum(kafka_connect_worker_connector_failed_task_count) > 0
labels:
severity: major
annotations:
summary: 'Kafka Connect Task Failure'
description: 'Some tasks are failing, this can not be automatically recovered.'
- name: bridge
rules:
- alert: BridgeContainersDown
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -161,18 +161,20 @@ spec:
description: 'All Kafka Connect containers have been down or in CrashLookBackOff status for 3 minutes'
- alert: ConnectFailedConnector
expr: sum(kafka_connect_connector_status{status="failed"}) > 0
for: 1m
labels:
severity: major
annotations:
summary: 'Kafka Connect Connector Failure'
description: 'Some connectors are failing, this can not be automatically recovered.'
description: 'One or more connectors have been in failed state for 1 minute,'
- alert: ConnectFailedTask
expr: sum(kafka_connect_worker_connector_failed_task_count) > 0
for: 1m
labels:
severity: major
annotations:
summary: 'Kafka Connect Task Failure'
description: 'Some tasks are failing, this can not be automatically recovered.'
description: 'One or more tasks have been in failed state for 1 minute.'
- name: bridge
rules:
- alert: BridgeContainersDown
Expand Down

0 comments on commit 8de09c0

Please sign in to comment.