Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894

adrianreber · 2023-05-08T15:07:10Z

What happened?

This is mainly for tracking.

Although restoring a container in Kubernetes with cgroup v2 works the container will be immediately killed by Kubernetes as CRIU will restore the container in the old cgroup.

Outside of Kubernetes the behaviour cannot be seen as it seems only Kubernetes kills processes in unknown cgroups.

CRIU stores information about the cgroup during checkpointing and restores that information. I am not sure why this error cannot be seen with cgroup v1.

Fortunately runc has a fix for this problem in the merged PR opencontainers/runc#3546.

Unfortunately this fix has not made its way into one of the existing runc releases, yet.

@kolyshkin any ideas when the ignore setting for cgroups will appear in a runc release?

The text was updated successfully, but these errors were encountered:

github-actions · 2023-06-08T00:02:25Z

A friendly reminder that this issue had no activity for 30 days.

github-actions · 2023-09-06T00:02:32Z

Closing this issue since it had no activity in the past 90 days.

WhaleSpring · 2023-10-07T07:20:24Z

Hi! Adrianreber! @adrianreber
I meet the same problem just like MaxFuhrich @MaxFuhrich , the process that creates the problem is the same and I alse refer to the blog https://martinheinz.dev/blog/85 .The process that creates the problem is the same.
The difference is that I use centos7 and cgroupv1 .

As a result , it doesn't seem to be problem of cgroupv1.
And I can restore a pod at one of my nodes but others not to restore with problem as follows just like MaxFuhrich:

And I create a issue #7349 to describe my problem before see this one.

adrianreber added the kind/bug Categorizes issue or PR as related to a bug. label May 8, 2023

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 8, 2023

adrianreber mentioned this issue Jun 27, 2023

Error on restoring customized mysql application via Kubernetes C/R #6972

Closed

github-actions bot added the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Sep 6, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894

Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894

adrianreber commented May 8, 2023 •

edited

Loading

github-actions bot commented Jun 8, 2023

github-actions bot commented Sep 6, 2023

WhaleSpring commented Oct 7, 2023 •

edited

Loading

Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894

Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894

Comments

adrianreber commented May 8, 2023 • edited Loading

What happened?

github-actions bot commented Jun 8, 2023

github-actions bot commented Sep 6, 2023

WhaleSpring commented Oct 7, 2023 • edited Loading

adrianreber commented May 8, 2023 •

edited

Loading

WhaleSpring commented Oct 7, 2023 •

edited

Loading