Add missing requeue to Datacenter decommission #322

burmanm · 2022-04-19T06:45:28Z

What this PR does:
Decommissioning datacenter is missing a requeue after scaling down is done, but the pods haven't been removed yet. This can cause the operator to remove PVCs before Cassandra has properly shutdown (it has 30s wait after decommission).

Which issue(s) this PR fixes:
Fixes #323

Checklist

Changes manually tested
Automated Tests added/updated
Documentation added/updated
CHANGELOG.md updated (not required for documentation PRs)
CLA Signed: DataStax CLA

jsanda · 2022-04-20T03:08:57Z

@burmanm can you please create an issue for this?

jsanda · 2022-04-20T03:14:20Z

pkg/reconciliation/reconcile_datacenter.go

@@ -70,6 +70,8 @@ func (rc *ReconciliationContext) ProcessDeletion() result.ReconcileResult {
 				// Exiting to let other parts of the process take care of the decommission
 				return result.Continue()
 			}
+			// How could we have pods if we've decommissioned everything?
+			return result.RequeueSoon(5)


Are you adding the requeue here to handle the scenario where len(dcs) == 1?

Why would we requeue in that case? If len(dcs) == 1, we go and delete the DC.

It's not obvious to me why the need for the requeue here. Can you explain?

* Add requeue if we still have pods although decommission has succeeded * CHANGELOG (cherry picked from commit addd528)

burmanm added 3 commits April 19, 2022 09:45

Add logging to catch the flakiness of decommission_dc test

beb3baa

Add requeue if we still have pods although decommission has succeeded

bb656cb

Revert logging changes

e105c6e

burmanm changed the title ~~Add logging to catch the flakiness of decommission_dc test~~ Add missing requeue to Datacenter decommission Apr 19, 2022

CHANGELOG

ac692bc

burmanm marked this pull request as ready for review April 19, 2022 12:32

burmanm requested a review from a team as a code owner April 19, 2022 12:32

jsanda reviewed Apr 20, 2022

View reviewed changes

jsanda approved these changes Apr 21, 2022

View reviewed changes

burmanm merged commit addd528 into k8ssandra:master Apr 21, 2022

burmanm added a commit that referenced this pull request May 12, 2022

Add missing requeue to Datacenter decommission (#322)

5f43ae0

* Add requeue if we still have pods although decommission has succeeded * CHANGELOG (cherry picked from commit addd528)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add missing requeue to Datacenter decommission #322

Add missing requeue to Datacenter decommission #322

burmanm commented Apr 19, 2022 •

edited

Loading

jsanda commented Apr 20, 2022

jsanda Apr 20, 2022

burmanm Apr 20, 2022

jsanda Apr 20, 2022

Add missing requeue to Datacenter decommission #322

Add missing requeue to Datacenter decommission #322

Conversation

burmanm commented Apr 19, 2022 • edited Loading

jsanda commented Apr 20, 2022

jsanda Apr 20, 2022

Choose a reason for hiding this comment

burmanm Apr 20, 2022

Choose a reason for hiding this comment

jsanda Apr 20, 2022

Choose a reason for hiding this comment

burmanm commented Apr 19, 2022 •

edited

Loading