Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UPSTREAM: 62152: Keep node.kubeconfig correct during rotation #19857

Merged
merged 1 commit into from
May 30, 2018

Conversation

smarterclayton
Copy link
Contributor

The node.kubeconfig created by bootstrap will not work after the first
rotation - write bootstrap credentials into the cert store in order to
guarantee that rotation preserves that config.

Without this, SDN, the sync pod, and a few other node level actions will stop working when the pod gets rotated.

@liggitt

The node.kubeconfig created by bootstrap will not work after the first
rotation - write bootstrap credentials into the cert store in order to
guarantee that rotation preserves that config.
@openshift-ci-robot openshift-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label May 27, 2018
@smarterclayton smarterclayton added this to the v3.10 milestone May 27, 2018
@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 27, 2018
@liggitt
Copy link
Contributor

liggitt commented May 27, 2018

Do we test actual rotation in any of our CI tests?

@smarterclayton
Copy link
Contributor Author

All the GCP tests run with rotation on at 15 minutes, which means they rotate 2 times on average per run. If rotation is broken GCP e2e should break ~10-30% of the time. We don't have a rotation e2e but we need something that forces and verifies upstream.

@smarterclayton
Copy link
Contributor Author

/retest

@smarterclayton
Copy link
Contributor Author

I think this also fixes a "kubelet bypasses bootstrapping when the last bootstrap cert expires" bug, because we only ever read the bootstrap cert from the store (it will be nil if the cert has expired) and if it's nil we ask for a new cert. I hit this on api.ci tonight when i accidentally killed the autoapprover for > 15m which caused a node's bootstrap cert to expire. I had to clear the certs in order to proceed. With this fix we should have cleared it ourselves.

@smarterclayton
Copy link
Contributor Author

smarterclayton commented May 29, 2018 via email

@smarterclayton
Copy link
Contributor Author

@liggitt without this we can’t have rotation

@liggitt
Copy link
Contributor

liggitt commented May 30, 2018

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label May 30, 2018
@openshift-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: liggitt, smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit c1a18ca into openshift:master May 30, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants