Unable to ignore FailedScheduling event #1046

AObuchow · 2023-02-16T21:25:29Z

Description

#978 added an extra check for the PodUnschedulable condition, for cases when pod events were not reliable to report issues with the workspace deployment. That fix was made obsolete by 255d699.

Now it's come to my attention that, if you try and add the FailedScheduling event to the list of ignoredUnrecoverableEvents in the DWOC, the PodUnschedulable condition will still be caught, and the workspace will fail.

How To Reproduce

Add FailedScheduling to the ignoredUnrecoverableEvents in the DWOC:

apiVersion: controller.devfile.io/v1alpha1
config:
  routing:
    clusterHostSuffix: 192.168.39.246.nip.io
    defaultRoutingClass: basic
  workspace:
+    ignoredUnrecoverableEvents:
+    - FailedScheduling
    imagePullPolicy: Always

Start a workspace that causes a FailedScheduling event, e.g. by requesting more CPU than the cluster can provide:

kind: DevWorkspace
apiVersion: workspace.devfile.io/v1alpha2
metadata:
  name: theia-next-high-cpu
spec:
  started: true
  template:
    projects:
      - name: web-nodejs-sample
        git:
          remotes:
            origin: "https://github.com/che-samples/web-nodejs-sample.git"
    components:
      - name: theia
        plugin:
          uri: https://che-plugin-registry-main.surge.sh/v3/plugins/eclipse/che-theia/latest/devfile.yaml
          components:
            - name: theia-ide
              container:
                env:
                  - name: THEIA_HOST
                    value: 0.0.0.0
                memoryRequest: 2Gi
                memoryLimit: 16Gi
                cpuRequest: 4000m
                cpuLimit: 8000m
    commands:
      - id: say-hello
        exec:
          component: theia-ide
          commandLine: echo "Hello from $(pwd)"
          workingDir: ${PROJECTS_ROOT}/project/app

See that the workspace still fails, stating the pod is unschedulable:

$ kubectl get dw -n $NAMESPACE -w
NAME                  DEVWORKSPACE ID             PHASE      INFO
theia-next-high-cpu   workspace544c789045e040d0   Failed   Pod is unschedulable: 0/1 nodes are available: 1 Insufficient cpu. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod.

Expected behavior

The workspace does not fail immediately, and should time out instead.

The text was updated successfully, but these errors were encountered:

This reverts commit 0cad9a0. Fix devfile#1046

This reverts commit 0cad9a0. Fix devfile#1046 Signed-off-by: Andrew Obuchowicz <[email protected]>

This reverts commit 0cad9a0. Fix devfile#1046 Signed-off-by: Andrew Obuchowicz <[email protected]> (cherry picked from commit 4aa1755)

This reverts commit 0cad9a0. Fix #1046 Signed-off-by: Andrew Obuchowicz <[email protected]> (cherry picked from commit 4aa1755)

AObuchow self-assigned this Feb 16, 2023

AObuchow mentioned this issue Feb 16, 2023

Workspace pod FailedScheduling event not being properly caught #977

Closed

AObuchow added a commit to AObuchow/devworkspace-operator that referenced this issue Feb 17, 2023

Revert "fix: check pods for unschedulable condition"

f5fada3

This reverts commit 0cad9a0. Fix devfile#1046

AObuchow mentioned this issue Feb 17, 2023

Revert "fix: check pods for unschedulable condition" (#978) #1047

Merged

3 tasks

AObuchow added a commit to AObuchow/devworkspace-operator that referenced this issue Feb 17, 2023

Revert "fix: check pods for unschedulable condition"

74b0bf8

This reverts commit 0cad9a0. Fix devfile#1046 Signed-off-by: Andrew Obuchowicz <[email protected]>

AObuchow closed this as completed in 4aa1755 Feb 21, 2023

amisevsk mentioned this issue Feb 22, 2023

Backport fixes for #1049 and #1046 to 0.19.x branch #1052

Merged

3 tasks

amisevsk pushed a commit that referenced this issue Feb 22, 2023

Revert "fix: check pods for unschedulable condition"

874a328

This reverts commit 0cad9a0. Fix #1046 Signed-off-by: Andrew Obuchowicz <[email protected]> (cherry picked from commit 4aa1755)

amisevsk added this to the v0.20.x milestone Apr 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to ignore FailedScheduling event #1046

Unable to ignore FailedScheduling event #1046

AObuchow commented Feb 16, 2023

Unable to ignore FailedScheduling event #1046

Unable to ignore FailedScheduling event #1046

Comments

AObuchow commented Feb 16, 2023

Description

How To Reproduce

Expected behavior