-
Notifications
You must be signed in to change notification settings - Fork 460
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Regression: PR 2253 causes all-at-once tenant update #2332
Comments
There is no such thing as rolling updates @mmulvanny in our operator. We always employ in-place updates of the container binary and then subsequently statefulset rolls the changes. However the cluster itself must be online way before this, please share the operator logs and let us make the right assessment on what happened. Thanks |
For upgrading to version 6 we have documentation since there are breaking changes: |
closing, please reopen if the docs don't help. |
We performed an upgrade of another instance today and ran into the same issue. We use Flux to manage our Helm releases, but our upgrade steps were equivalent to the Helm upgrade steps on the page @cesnietor linked. We upgraded the tenant's and the operator's Helm charts from 5.0.15 to 6.0.3 simultaneously. Our operator log is here: minio-operator-6.0.3-upgrade.log @harshavardhana we always see exactly the behavior you described where the controller updates the Statefulset and then the pods restart one-by-one. That started to happen in this case too, but then the operator deleted the pods and forced them to start up together. I tried modifying the environment variables of the tenant StatefulSet in our test environment to see if I could get the 6.0.3 operator to delete pods and wasn't able to. Was there a particular condition that would invoke the code path of PR 2253 that I failed to reproduce by doing that? |
@mmulvanny Unfortunately, Kubernetes doesn't allow an STS to restart all pods at once (it can do this for a deployment using the The operator will force all pods to terminate (and thus restart all at once) when it detects that there was a change to any of the environment variables (starting with |
Expected Behavior
Updating a tenant should cause pods to update in a rolling fashion, and the MinIO service should remain available at all times.
Current Behavior
Updating to 6.0.3 causes the operator to delete all tenant pods at once, causing a MinIO outage.
Possible Solution
Steps to Reproduce (for bugs)
Context
We upgraded the MinIO operator from 5.0.15 to 6.0.3.
Regression
This was caused by the combination of PR 2221 (which moved environment configuration to a sidecar) and PR 2253 (which deleted pods on configuration changes). Was PR 2253 intended to remove rolling updates?
Your Environment
minio-operator
): This occurred immediately after an upgrade to 6.0.3.The text was updated successfully, but these errors were encountered: