Skip to content

Commit

Permalink
Only page when there are less than 500MB left on kubelet volume
Browse files Browse the repository at this point in the history
  • Loading branch information
fiunchinho committed Oct 29, 2024
1 parent 78fa24f commit ce482af
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 2 deletions.
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

## [Unreleased]

### Changed

- Change `KubeletVolumeSpaceTooLow` to only page when there are 500MB or less of space left, letting the node-problem-detector handle the rest.

## [4.21.1] - 2024-10-25

### Fixed
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -44,8 +44,8 @@ spec:
# When this happens, the problem_gauge metric has value 1, so we do a multiply join on that metric - 1 to get 0 when the metric is present and active, and keep the series values that are > 0.
# The right hand side of the or is necessary because we need to be alerted in clusters without the node-problem-detector.
# Note that we add 1 to the disk free space so we still get alerted when the free bytes are 0.
# We are also alerted if the free space is less than 2GB for 30 minutes.
expr: (( node_filesystem_free_bytes{cluster_type="workload_cluster",mountpoint=~"(/rootfs)?/var/lib/kubelet"} +1 < (2 * 1024 * 1024 * 1024)) * on (node, cluster_type, cluster_id, installation, organization, pipeline, region, customer) (1 - problem_gauge{reason="KubeletDiskIsFull"}) or sum (node_filesystem_free_bytes{cluster_type="workload_cluster",mountpoint=~"(/rootfs)?/var/lib/kubelet"} +1 < (2 * 1024 * 1024 * 1024)) by (node, cluster_type, cluster_id, installation, organization, pipeline, region, customer)) > 0
# We are also alerted if the free space is less than 500MB for 30 minutes.
expr: (( node_filesystem_free_bytes{cluster_type="workload_cluster",mountpoint=~"(/rootfs)?/var/lib/kubelet"} +1 < (500 * 1024 * 1024)) * on (node, cluster_type, cluster_id, installation, organization, pipeline, region, customer) (1 - problem_gauge{reason="KubeletDiskIsFull"}) or sum (node_filesystem_free_bytes{cluster_type="workload_cluster",mountpoint=~"(/rootfs)?/var/lib/kubelet"} +1 < (500 * 1024 * 1024)) by (node, cluster_type, cluster_id, installation, organization, pipeline, region, customer)) > 0
for: 60m
labels:
area: kaas
Expand Down

0 comments on commit ce482af

Please sign in to comment.