-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Snapshot data movement restore does not fully work with StorageClass with binding mode WaitForFirstConsumer #7561
Comments
This is by design. |
@Elias-elastisys And Velero snapshot data movement is not tested against the case of backing up volumes without pod, so is not officially supported. |
Alright thanks, unfortunate for my test case but it makes sense.
It is to truly backup all data in a cluster. If you run CronJobs or Jobs with PVs then there might be cases where backups run when the Job is not running. In that case the regular Velero backups will not get that data. Now as I said, CSI data movement seems to be able to successfully backup data without Pods, even with WaitForFirstConsumer since the PV is already there. The only issue is that the restore requires manual intervention which I guess is not the biggest issue, since it succeeds if you apply the Pod manually. But it would of course be greatly appreciated if it would be possible to catch this edge case as well. |
We had a discussion about this issue, here is the conclusion:
|
Add some test results here. |
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 14 days. If a Velero team member has requested log or more information, please provide the output of the shared commands. |
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 14 days. If a Velero team member has requested log or more information, please provide the output of the shared commands. |
#8044 is opened for an enhancement of this case |
What steps did you take and what happened:
I've been trying out the Volume Snapshot Data movement for backups and restores to s3, specifically in order to be able to backup PVs that have no currently associated running Pod, as this is not possible with regular Velero backups.
When doing a restore of a successful backup in a cluster with a StorageClass with binding mode WaitForFirstConsumer the restore will time out since Velero waits until a PV is provisioned before it creates its helper pod that facilitates the restore. But the storage provider waits until a Pod is attached to the PVC until it provisions storage, so the restore essentially deadlocks.
If I manually create a Pod that attaches to the restored PVC the restore eventually succeeds.
What did you expect to happen:
To be able to restore a backup successfully, even if the backup contains PV/PVCs without any attached Pod.
The following information will help us better understand what's going on:
If you are using velero v1.7.0+:
Please use
velero debug --backup <backupname> --restore <restorename>
to generate the support bundle, and attach to this issue, more options please refer tovelero debug --help
Anything else you would like to add:
I found this old "prioritized" issue: #2971 with the same problem but for Restic, while volume snapshot movement uses Kopia. This doesnt seem to have been updated or made any progress in over 2 years. Any updates?
Environment:
velero version
): v1.13.1velero client config get features
): Nonekubectl version
): v1.28.6volumeBindingMode: WaitForFirstConsumer
/etc/os-release
):Vote on this issue!
This is an invitation to the Velero community to vote on issues, you can see the project's top voted issues listed here.
Use the "reaction smiley face" up to the right of this comment to vote.
The text was updated successfully, but these errors were encountered: