Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1051

CraneShiEMC · 2023-08-26T18:45:46Z

Describe the bug
In unplanned node down + node removal scenario (with possible destructive force-deletion of pods and their metadata?), all the related mount points of all PVCs on this node including the k8s device global path mountpoints are also cleaned up. But, after the node was turned on again and added back to the k8s cluster, when OBS pods with PVCs initialzed, kubelet on this node directly issued wrong CSI call NodePublishVolume inconsistent with volumes' real status with the skip of required successful CSI call NodeStageVolume that mounts volumes' k8s device global path with the real device path. As a result, CSI volumes would be turned to and stuck in Failed Status at this time. This wrong behavior violates the requirement of CSI spec https://github.com/container-storage-interface/spec/blob/master/spec.md#nodestagevolume

However, before k8s community fixed this kubelet issue instantly, I think we need to consider changing code on CSI side to give a workround fix to handle the case here resulted from the kubelet issue.

Environment (please complete the following information):
Rke2

To Reproduce
Unplanned directly power off the node, and then try to remove the node with forceful deletion of pods and their metadata

Expected behavior
When the pods with PVCs initialized, for each volume, kubelet should issue NodeStageVolume CSI call and then, after the successful completion of NodeStageVolume, issue NodePublishVolume CSI call.

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.

CraneShiEMC added the bug Something isn't working label Aug 26, 2023

CraneShiEMC linked a pull request Aug 26, 2023 that will close this issue

[ISSUE-1051] Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1050

Open

7 tasks

CraneShiEMC changed the title ~~Need to Handle Kubelet's Problematic CSI Call Inconsistent with Real Volume Status~~ Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status Aug 27, 2023

This was referenced Aug 27, 2023

[ISSUE-1051] Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status for OBS Release 1.3 #1052

Closed

[ISSUE-1051] Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status for OBS Release 1.3 #1053

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1051

Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1051

CraneShiEMC commented Aug 26, 2023 •

edited

Loading

Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1051

Need to Handle Kubelet's Wrong CSI Call Inconsistent with Real Volume Status #1051

Comments

CraneShiEMC commented Aug 26, 2023 • edited Loading

CraneShiEMC commented Aug 26, 2023 •

edited

Loading