-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
✨ Add support for KCP remediation during cluster provisioning #7963
✨ Add support for KCP remediation during cluster provisioning #7963
Conversation
18c798f
to
d461439
Compare
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First set of comments
controlplane/kubeadm/api/v1beta1/kubeadm_control_plane_types.go
Outdated
Show resolved
Hide resolved
test/infrastructure/docker/internal/controllers/dockermachine_controller.go
Outdated
Show resolved
Hide resolved
test/infrastructure/docker/internal/controllers/dockermachine_controller.go
Outdated
Show resolved
Hide resolved
test/infrastructure/docker/internal/controllers/dockermachine_controller.go
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the review spam :)
Now reviewed everything except the remediation_test.go (anticipating some changes there based on prod-code changes)
Great work on the e2e test!!
Wondering if it makes sense to move the e2e test into a separate PR that we can merge through independently.
I guess most of the test should work against main as well (of course not remediating the first machine).
Might just be nice to keep both PRs and reviews more focused and to have some sort of "remediation worked before the change and after" confirmation.
1d2d1a3
to
17e2676
Compare
@vincepri @sbueringer Please note that:
|
/retest |
/test pull-cluster-api-e2e-full-main |
internal/controllers/machinehealthcheck/machinehealthcheck_targets.go
Outdated
Show resolved
Hide resolved
internal/controllers/machinehealthcheck/machinehealthcheck_targets.go
Outdated
Show resolved
Hide resolved
c71c4f3
to
980dde5
Compare
/test pull-cluster-api-e2e-full-main |
@fabriziopandini I'm pretty sure the e2e test doesn't work on Linux. I'm working on a fix which works at least on my Machine. I'll open a PR to verify if it works in Prow as well. |
PR is open: #8075 Let's see if that works. I also improved the token creation. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
first part of next round of review
(everything except internal/controllers)
@fabriziopandini Opened a PR in your repo (fabriziopandini#55) which merges #8075 into the current PR |
just fyi, I'll address my own and other findings and then merge the fixes into this PR via #8075 |
Co-authored-by: sbueringer <[email protected]>
980dde5
to
e98e2de
Compare
@sbueringer @vincepri @jackfrancis this is ready for a final pass |
/lgtm |
LGTM label has been added. Git tree hash: 18d61cd0fc14f15b5e8f38bfd4e1c26871e678d8
|
starting lazy consensus from community meeting, 1 week from today (target Feb 22) |
/test pull-cluster-api-e2e-full-main |
Unrelated flake |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: sbueringer The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest |
What this PR does / why we need it:
This PR proposes an amendment to KCP remediation so it will be possible to remediate failure happening while provisioning the CP (both first CP and CP machines while current replica < desired replica); It also introduces opt-in control of remediation retry behavior.
#7855 is the amendment to the KCP proposal describing those changes
Which issue(s) this PR fixes:
Fixes #7496