Daemonset stuck in progressing #16951
Labels
bug
Something isn't working
component:application-controller
component:health-check
version:EOL
Latest confirmed affected version has reached EOL
Describe the bug
The daemonset is stuck in the progressing phase according to ArgoCD but the daemonset is running on each node (2) in the spot node pool.
Clicking on the daemonset, shows the following message for the health details as
Waiting for daemon set "opentelemetry-collector-agent" rollout to finish: 0 of 3 updated pods are available...
The status field on the live manifest is:
The output of
kubectl -n opentelemetry rollout status daemonset/opentelemetry-collector-agent
isdaemon set "opentelemetry-collector-agent" successfully rolled out
To Reproduce
Deploy the OpenTelemetry Collector as an application with two node pools on Azure:
Expected behavior
The daemonset should be marked as healthy because it is running two pods, one each on each of the spot nodes. The tolerations do not allow the daemonset to run on the system node pool, as the pod does not have the critical addons toleration. As daemonsets are out-of-the-box in Kubernetes, I would expect this to work without having to implement a custom health check.
Screenshots
Version
The text was updated successfully, but these errors were encountered: