[ws-manager] Wait for workspace pod to be ready #8258

sagor999 · 2022-02-16T21:18:18Z

Description

Make sure to wait for container to start up, in case of out of memory or other errors.

Related Issue(s)

How to test

Spin up new cluster in workspace preview env:
./new-vm.sh -v aledbf-wait.10 -z us-west1-c
then try to start up workspaces in it.

Release Notes

Improve handling of "Out of Memory" error when starting up workspaces

Documentation

sagor999 · 2022-02-17T00:40:44Z

/werft with-clean-slate-deployment

👎 unknown command: with-clean-slate-deployment
Use /werft help to list the available commands

sagor999 · 2022-02-17T00:41:12Z

/werft help

👍 You can interact with werft using: /werft command <args>.
Available commands are:

/werft run [annotation=value] which starts a new werft job from this context.
You can optionally pass multiple whitespace-separated annotations.
/werft help displays this help

sagor999 · 2022-02-17T00:42:58Z

/werft run

👍 started the job as gitpod-build-aledbf-wait.10

components/ws-manager/pkg/manager/manager.go

Co-authored-by: utam0k <[email protected]>

csweichel · 2022-02-17T08:30:48Z

This would help with the problem we're trying to solve, but most likely deteriorate the feedback mechanism towards the user by all but removing the Pending phase. I.e. now, users would see a very long "preparing workspace" phase until they finally get redirected.

How much of a concern that is is something we'd see in practice.
Certainly this is another bit of motivation to move towards a CRD based solution.

components/ws-manager/pkg/manager/manager.go

sagor999 · 2022-02-17T15:59:37Z

I reduced exponential backoff factor, as otherwise in worst case scenario it would have been waiting for hours there to retry.

kylos101 · 2022-02-17T17:02:44Z

components/ws-manager/pkg/manager/manager.go

@@ -199,7 +199,7 @@ func (m *Manager) StartWorkspace(ctx context.Context, req *api.StartWorkspaceReq
 	backoff := wait.Backoff{
 		Steps:    10,
 		Duration: 100 * time.Millisecond,
-		Factor:   5.0,
+		Factor:   2.5,


Would it make sense to leverage the cap property for backoff? I believe it's intent is to ensure the duration does't exceed a certain time limit.

Yeah, we could do that as well. What would be a reasonable cap? 5 min?

I think 5 minutes will do.

We can always bring it down if we need, but this way it's a predictable experience.

W/o accounting for sleep (and jitter), that would get users to iteration 8 (but not beyond) here: https://go.dev/play/p/dtUxS6TwTrw

sagor999 · 2022-02-17T20:16:41Z

Continue working in new PR (due to werft issues in this one): #8289

[ws-manager] Wait for workspace pod to be ready

c2b136a

roboquat added do-not-merge/work-in-progress do-not-merge/release-note-label-needed size/M labels Feb 16, 2022

roboquat added release-note and removed do-not-merge/release-note-label-needed labels Feb 17, 2022

sagor999 marked this pull request as ready for review February 17, 2022 01:53

sagor999 requested a review from a team February 17, 2022 01:53

roboquat removed the do-not-merge/work-in-progress label Feb 17, 2022

github-actions bot added the team: workspace Issue belongs to the Workspace team label Feb 17, 2022

utam0k reviewed Feb 17, 2022

View reviewed changes

components/ws-manager/pkg/manager/manager.go Show resolved Hide resolved

utam0k reviewed Feb 17, 2022

View reviewed changes

components/ws-manager/pkg/manager/manager.go Outdated Show resolved Hide resolved

Fix pod.status.reason comparison

6daf417

Co-authored-by: utam0k <[email protected]>

princerachit reviewed Feb 17, 2022

View reviewed changes

components/ws-manager/pkg/manager/manager.go Outdated Show resolved Hide resolved

princerachit reviewed Feb 17, 2022

View reviewed changes

components/ws-manager/pkg/manager/manager.go Outdated Show resolved Hide resolved

princerachit reviewed Feb 17, 2022

View reviewed changes

components/ws-manager/pkg/manager/manager.go Outdated Show resolved Hide resolved

sagor999 added 2 commits February 17, 2022 15:35

imrove error logging

39219d9

reduce exponential backoff when creating workspace pod

6ec6cee

kylos101 reviewed Feb 17, 2022

View reviewed changes

sagor999 closed this Feb 17, 2022

aledbf deleted the aledbf/wait branch August 13, 2022 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ws-manager] Wait for workspace pod to be ready #8258

[ws-manager] Wait for workspace pod to be ready #8258

sagor999 commented Feb 16, 2022 •

edited

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading

csweichel commented Feb 17, 2022

sagor999 commented Feb 17, 2022

kylos101 Feb 17, 2022

sagor999 Feb 17, 2022

kylos101 Feb 17, 2022

sagor999 commented Feb 17, 2022

[ws-manager] Wait for workspace pod to be ready #8258

[ws-manager] Wait for workspace pod to be ready #8258

Conversation

sagor999 commented Feb 16, 2022 • edited Loading

Description

Related Issue(s)

How to test

Release Notes

Documentation

sagor999 commented Feb 17, 2022 • edited by werft-gitpod-dev-com bot Loading

sagor999 commented Feb 17, 2022 • edited by werft-gitpod-dev-com bot Loading

sagor999 commented Feb 17, 2022 • edited by werft-gitpod-dev-com bot Loading

csweichel commented Feb 17, 2022

sagor999 commented Feb 17, 2022

kylos101 Feb 17, 2022

Choose a reason for hiding this comment

sagor999 Feb 17, 2022

Choose a reason for hiding this comment

kylos101 Feb 17, 2022

Choose a reason for hiding this comment

sagor999 commented Feb 17, 2022

sagor999 commented Feb 16, 2022 •

edited

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading

sagor999 commented Feb 17, 2022 •

edited by werft-gitpod-dev-com bot

Loading