Automate updates for k8s.io redirector service #176

ixdy · 2019-02-06T16:57:02Z

Currently, changes to the k8s.io redirector service (i.e any changes to the configs under the k8s.io subdirectory) require @ixdy or @thockin to manually update the cluster, basically following something like the following process:

cd k8s.io/
kubectl -n k8s-io-canary apply -f configmap-nginx.yaml
# pick up new configs by forcing nginx to restart
kubectl -n k8s-io-canary scale deployment k8s-io --replicas=0
kubectl -n k8s-io-canary scale deployment k8s-io --replicas=1
TARGET_IP=[canary namespace service IP] make test

# if tests pass, deploy to production
kubectl -n k8s-io-prod apply -f configmap-nginx.yaml
# pick up new configs by forcing nginx to restart
kubectl -n k8s-io-prod scale deployment k8s-io --replicas=0
# note we scale back to 2, not 1
kubectl -n k8s-io-prod scale deployment k8s-io --replicas=2
# verify everything on prod
make test

There are lots of steps to automate here:

restarting nginx manually
testing on the canary namespace before updating the prod namespace
- ideally we test before merge even, which is currently a manual process
requiring a human to deploy changes (rather than automatically updating on merge)

bartsmykla · 2019-02-25T12:42:54Z

Hi @ixdy what do you think about me helping with that. Can you give me some hints or directions about people who could give me more informations? :-)

ixdy · 2019-02-26T23:02:44Z

Take one of the steps I listed above and figure out some solution? e.g. figure out how to automatically update nginx when pushing a new configmap. (There are several approaches with different tradeoffs, and nobody's taken the time to figure out what makes the most sense here.)

bartsmykla · 2019-02-27T13:52:56Z

In one of my past projects we were using the approach of creating a sidecar container which was watching changes inside specified paths, and when they appeared it was reading configmap, parsing it and sending to specified endpoint., but here it would be different.

One of the approaches would be to use feature, which is currently in beta -> Share Process Namespace between Containers in a Pod. We would create a sidecar component which will be watching configmap changes and if they will appear we could send a HUP signal to the nginx.

It's good because we are separating logic related to watching changes and sending appropriate signals to separate place, and not touching basic nginx image. Problem can appear if we don't want to use beta feature, or if our kubernetes cluster's version will be lower than 1.13.

Another approach will be related to running two binaries/scripts inside a container. One for nginx and second for our watcher which will send HUP signal to nginx process when changes appear.
Cons:

We'll face a problem of multiple entrypoints, which can be solved by creating some wrapping bash scripts, or using something like supervisord
- the first approach won't give us a lot of control when something won't go smoothly but it's easy and lightweight solution;
- the second approach expect from us to install environment for supervisord (python, pip) or using it's go's equivalent (ie. https://github.com/ochinchina/supervisord)
- We will also have to write script to watch changes in configmap files (ie. using inotify-tools) or some solution in go
We'll have to create and maintain our own image with all that changes

Actually I succeeded to create a solution using https://github.com/ochinchina/supervisord and https://github.com/fsnotify/fsnotify so I can clean it and push somewhere to test.

If it would be possible I think I would choose the first option, but I don't have experience with that feature so probably I couldn't see some downsides.

bartsmykla · 2019-02-28T13:59:41Z

@ixdy I have created simple Go app using: https://kubernetes.io/docs/tasks/configure-pod-container/share-process-namespace/ which can be added as a sidecar container. I have to figure out how to test it yet, but for tests we can play with it: https://github.com/bartsmykla/nginx-reloader

bartsmykla · 2019-03-11T10:03:54Z

Hi @ixdy did you have some time to look at my suggestions maybe?

dims · 2019-03-12T18:18:32Z

@ixdy WDYT? ^

ixdy · 2019-03-15T21:34:38Z

@bartsmykla I also think I prefer the sidecar option, since that seems like a cleaner, more generalizable pattern, though I don't think GKE supports kubernetes 1.13 yet.

Do you have access to a kubernetes cluster for testing? You could try updating the manifests in the k8s.io directory of this repo to try out your approach.

ixdy · 2019-03-22T03:38:26Z

though, a counterpoint: it'd be nice to take advantage of the fact that we're using a Deployment right now, and so we should perform a rolling update of the config (in case it causes nginx to crash).

I've seen patterns where the ConfigMap containing the nginx config is somehow munged to work with a Deployment rolling update - maybe something like this?

(I think I've seen similar but different patterns elsewhere, but I'm not immediately finding them now.)

bartsmykla · 2019-03-22T06:40:00Z

Let me dig into it a little bit more

dims · 2019-07-24T16:04:09Z

pending getting a cluster up

fejta-bot · 2019-10-22T16:48:14Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

stp-ip · 2019-10-22T17:02:28Z

/remove-lifecycle stale

stp-ip · 2020-01-08T17:19:41Z

/assign

fejta-bot · 2020-04-07T17:23:26Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

stp-ip · 2020-04-08T09:10:31Z

/remove-lifecycle stale

fejta-bot · 2020-07-07T09:47:49Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

stp-ip · 2020-07-07T12:22:46Z

/remove-lifecycle stale

fejta-bot · 2020-10-05T12:54:43Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

stp-ip · 2020-10-05T12:56:24Z

/remove-lifecycle stale

fejta-bot · 2021-01-03T13:04:48Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

spiffxp · 2021-01-22T22:31:56Z

/remove-priority backlog
/priority important-longterm
/milestone v1.21

Since we're going to want to make changes to dl.k8s.io as part of #1569, now might be the right time to re-examine how this is deployed/managed

spiffxp · 2021-01-22T22:34:46Z

/remove-lifecycle stale

spiffxp · 2021-03-16T19:22:35Z

FYI @ameukam @nikhita

spiffxp · 2021-04-15T21:56:41Z

/milestone v1.22

fejta-bot · 2021-07-14T22:47:55Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

spiffxp · 2021-07-15T02:00:00Z

/remove-lifecycle stale
Still relevant. We are close with the deploy.sh script but it's not as simple as run once parameterless. Not that far off from something an appropriately privileged prowjob could run though

spiffxp · 2021-07-30T13:33:17Z

/close
This was accomplished by kubernetes/test-infra#22970 as part of #2151

k8s-ci-robot · 2021-07-30T13:33:24Z

@spiffxp: Closing this issue.

In response to this:

/close
This was accomplished by kubernetes/test-infra#22970 as part of #2151

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ixdy added the wg/k8s-infra label Feb 6, 2019

spiffxp added this to the migrate-low-risk milestone Apr 30, 2019

spiffxp added the area/infra Infrastructure management, infrastructure design, code in infra/ label May 1, 2019

thockin added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Jul 8, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 22, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 22, 2019

k8s-ci-robot assigned stp-ip Jan 8, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 7, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 8, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 7, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 7, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 5, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 5, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 3, 2021

spiffxp removed this from the migrate-low-risk milestone Jan 13, 2021

k8s-ci-robot added priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. and removed priority/backlog Higher priority than priority/awaiting-more-evidence. labels Jan 22, 2021

k8s-ci-robot added this to the v1.21 milestone Jan 22, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 22, 2021

k8s-ci-robot modified the milestones: v1.21, v1.22 Apr 15, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 14, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 15, 2021

k8s-ci-robot closed this as completed Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate updates for k8s.io redirector service #176

Automate updates for k8s.io redirector service #176

ixdy commented Feb 6, 2019

bartsmykla commented Feb 25, 2019

ixdy commented Feb 26, 2019

bartsmykla commented Feb 27, 2019 •

edited

Loading

bartsmykla commented Feb 28, 2019

bartsmykla commented Mar 11, 2019

dims commented Mar 12, 2019

ixdy commented Mar 15, 2019

ixdy commented Mar 22, 2019

bartsmykla commented Mar 22, 2019

dims commented Jul 24, 2019

fejta-bot commented Oct 22, 2019

stp-ip commented Oct 22, 2019

stp-ip commented Jan 8, 2020

fejta-bot commented Apr 7, 2020

stp-ip commented Apr 8, 2020

fejta-bot commented Jul 7, 2020

stp-ip commented Jul 7, 2020

fejta-bot commented Oct 5, 2020

stp-ip commented Oct 5, 2020

fejta-bot commented Jan 3, 2021

spiffxp commented Jan 22, 2021

spiffxp commented Jan 22, 2021

spiffxp commented Mar 16, 2021

spiffxp commented Apr 15, 2021

fejta-bot commented Jul 14, 2021

spiffxp commented Jul 15, 2021

spiffxp commented Jul 30, 2021

k8s-ci-robot commented Jul 30, 2021

Automate updates for k8s.io redirector service #176

Automate updates for k8s.io redirector service #176

Comments

ixdy commented Feb 6, 2019

bartsmykla commented Feb 25, 2019

ixdy commented Feb 26, 2019

bartsmykla commented Feb 27, 2019 • edited Loading

bartsmykla commented Feb 28, 2019

bartsmykla commented Mar 11, 2019

dims commented Mar 12, 2019

ixdy commented Mar 15, 2019

ixdy commented Mar 22, 2019

bartsmykla commented Mar 22, 2019

dims commented Jul 24, 2019

fejta-bot commented Oct 22, 2019

stp-ip commented Oct 22, 2019

stp-ip commented Jan 8, 2020

fejta-bot commented Apr 7, 2020

stp-ip commented Apr 8, 2020

fejta-bot commented Jul 7, 2020

stp-ip commented Jul 7, 2020

fejta-bot commented Oct 5, 2020

stp-ip commented Oct 5, 2020

fejta-bot commented Jan 3, 2021

spiffxp commented Jan 22, 2021

spiffxp commented Jan 22, 2021

spiffxp commented Mar 16, 2021

spiffxp commented Apr 15, 2021

fejta-bot commented Jul 14, 2021

spiffxp commented Jul 15, 2021

spiffxp commented Jul 30, 2021

k8s-ci-robot commented Jul 30, 2021

bartsmykla commented Feb 27, 2019 •

edited

Loading