-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
velero restic backup timeout #6450
Comments
The current timeout setting is for the whole backup -- i.e. if a pod volume backup is not completed 4 hours since backup started, then time out (which means subsequent ones will time out too) -- the intent of this timeout is "choose a time that no backup should take longer than". If you had 100 volumes and each was taking over an hour, then your backup would take days. It may be that there would be some value in an additional "time out an individual volume if this volume takes longer than x minutes", but that should probably be an enhancement to add it on top of the current timeout, not instead of. |
Hmm yes that makes sense @sseago; thank you for the explanation. |
@ugur99 |
Thanks for the support @Lyndon-Li, due to confidentiality I'm afraid I can't share the full Velero debug output; but I'd be happy to share specific logs/manifests, whatever you need to troubleshoot. But to be honest, I've shared the related logs/descriptions in the issue; there are no any useful info even in debug mode. Time to time we are observing that restic stucks when backing up volumes like configmap or emptyDir. And no idea what is going on. |
@ugur99
Therefore, the best practice is to exclude them from backup. Let's see how we can filter the volumes by their type, as an future enhancement of Velero's filter system. |
FYI, I've opened a separate issue #6482 for the filter enhancement. |
Thank you @Lyndon-Li! |
Closing as there is no further request for this issue. |
What steps did you take and what happened:
Velero Restic podVolumeBackups has started to get stuck in the first attempt and it can not proceed for other volumes.
What did you expect to happen:
Even though it hangs for some podvolumebackups I believe that after some timeout for per volume it should be able to continue with the next volume.
The following information will help us better understand what's going on:
Anything else you would like to add:
I know that it is best practice to ignore some dummy volumes to make Velero Restic Backup work more efficiently, but since we cannot do this by volume type, we are currently looking for an easy way to do this for the entire cluster. I mean, if there is a way to tell Restic to only back up pvc-type volumes, that would be great.
Environment:
velero version
):velero client config get features
):kubectl version
):The text was updated successfully, but these errors were encountered: