Checkpoint/Restore not working with cgroup v2 and Kubernetes #6894
Labels
kind/bug
Categorizes issue or PR as related to a bug.
lifecycle/rotten
Denotes an issue or PR that has aged beyond stale and will be auto-closed.
lifecycle/stale
Denotes an issue or PR has remained open with no activity and has become stale.
What happened?
This is mainly for tracking.
Although restoring a container in Kubernetes with cgroup v2 works the container will be immediately killed by Kubernetes as CRIU will restore the container in the old cgroup.
Outside of Kubernetes the behaviour cannot be seen as it seems only Kubernetes kills processes in unknown cgroups.
CRIU stores information about the cgroup during checkpointing and restores that information. I am not sure why this error cannot be seen with cgroup v1.
Fortunately runc has a fix for this problem in the merged PR opencontainers/runc#3546.
Unfortunately this fix has not made its way into one of the existing runc releases, yet.
@kolyshkin any ideas when the
ignore
setting for cgroups will appear in a runc release?The text was updated successfully, but these errors were encountered: