-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MCM does not reset .status.failedMachines
of MachineDeployment
#456
Comments
Dupe of #476 . |
/close |
#476 is about the failed machine metric counter, whereas this issue is about reseting the |
@rfranzke currently the machine-controller-manager/pkg/controller/deployment_machineset_util.go Lines 128 to 135 in e21b931
Otherwise for some of the fail scenarios it can get difficult to catch the failed operations and reasons because of very quick create and delete operations. As a quick experiment, return a random failure here:
In the observation reported in the issue, once the machine joins and the machine deployment becomes healthy (all machines join the cluster), the Please let me know if there is any issue created because of current behaviour. We can take corrective measures accordingly. |
I still think this should be improved, i.e., the |
What happened:
MCM does not update the
.status.failedMachine
of theMachineDeployment
after the.status.lastOperation
of theMachine
changes (e.g., fromFailed
->Processing
(e.g., after the credentials have been fixed)):(compare the timestamps)
What you expected to happen:
The
.status.failedMachines
is properly updated when.status.lastOperation
ofMachine
objects are changed.The text was updated successfully, but these errors were encountered: