Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fluentd content in prom cat io #220

Merged
merged 2 commits into from
May 26, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 8 additions & 8 deletions resources/fluentd/ALERTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,14 @@
## No Input From Container
No Input From Container.

## Increased Error Ratio
Increased Error Ratio.
## High Error Ratio
High Error Ratio.

## Increased Retry Ratio
Increased Retry Ratio.
## High Retry Ratio
High Retry Ratio.

## Increased Retry Wait
Increased Retry Wait.
## High Retry Wait
High Retry Wait.

## Low Buffer Available Space
Low Buffer Available Space.
Expand All @@ -20,8 +20,8 @@ Buffer Queue Length Increasing.
## Buffer Total Bytes Increasing
Buffer Total Bytes Increasing.

## Increased Slow Flush Ratio
Increased Slow Flush Ratio.
## High Slow Flush Ratio
High Slow Flush Ratio.

## No Output Records From Plugin
No Output Records From Plugin.
Expand Down
32 changes: 16 additions & 16 deletions resources/fluentd/alerts.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,36 +13,36 @@ configurations:
rules:
- alert: '[Fluentd] No Input From Container'
expr: |
sum by (input_namespace, input_container)(rate(fluentd_input_status_num_records_total[1m])) == 0
sum by (input_namespace, input_container)(rate(fluentd_input_status_num_records_total[5m])) == 0
for: 5m
labels:
severity: warning
annotations:
description: No Input From Container.
- alert: '[Fluentd] Increased Error Ratio'
- alert: '[Fluentd] High Error Ratio'
expr: |
sum by (type, plugin_id)(rate(fluentd_output_status_num_errors[1m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[1m]))> 0.05
sum by (type, plugin_id)(rate(fluentd_output_status_num_errors[5m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[5m]))> 0.05
for: 5m
labels:
severity: critical
annotations:
description: Increased Error Ratio.
- alert: '[Fluentd] Increased Retry Ratio'
description: High Error Ratio.
- alert: '[Fluentd] High Retry Ratio'
expr: |
sum by (type, plugin_id)(rate(fluentd_output_status_retry_count[1m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[1m]))> 0.05
sum by (type, plugin_id)(rate(fluentd_output_status_retry_count[5m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[5m]))> 0.05
for: 5m
labels:
severity: critical
annotations:
description: Increased Retry Ratio.
- alert: '[Fluentd] Increased Retry Wait'
description: High Retry Ratio.
- alert: '[Fluentd] High Retry Wait'
expr: |
sum by (type, plugin_id)(max_over_time(fluentd_output_status_retry_wait[1m])) > 60
sum by (type, plugin_id)(max_over_time(fluentd_output_status_retry_wait[5m])) > 60
for: 5m
labels:
severity: critical
annotations:
description: Increased Retry Wait.
description: High Retry Wait.
- alert: '[Fluentd] Low Buffer Available Space'
expr: |
fluentd_output_status_buffer_available_space_ratio < 10
Expand All @@ -56,7 +56,7 @@ configurations:
avg_over_time(fluentd_output_status_buffer_queue_length[5m]) - avg_over_time(fluentd_output_status_buffer_queue_length[5m] offset 5m)> 0
for: 5m
labels:
severity: critical
severity: warning
annotations:
description: Buffer Queue Length Increasing.
- alert: '[Fluentd] Buffer Total Bytes Increasing'
Expand All @@ -67,19 +67,19 @@ configurations:
severity: warning
annotations:
description: Buffer Total Bytes Increasing.
- alert: '[Fluentd] Increased Slow Flush Ratio'
- alert: '[Fluentd] High Slow Flush Ratio'
expr: |
sum by (type, plugin_id)(rate(fluentd_output_status_slow_flush_count[1m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[1m]))> 0.05
sum by (type, plugin_id)(rate(fluentd_output_status_slow_flush_count[5m])) /sum by (type, plugin_id)(rate(fluentd_output_status_emit_count[5m]))> 0.05
for: 5m
labels:
severity: warning
annotations:
description: Increased Slow Flush Ratio.
description: High Slow Flush Ratio.
- alert: '[Fluentd] No Output Records From Plugin'
expr: |
rate(fluentd_output_status_emit_records[1m]) == 0
rate(fluentd_output_status_emit_records[5m]) == 0
for: 5m
labels:
severity: warning
annotations:
description: No Output Records From Plugin.
description: No Output Records From Plugin.