Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epic: Ensure durability for user workspace files #7901

Closed
75 of 77 tasks
kylos101 opened this issue Jan 28, 2022 · 13 comments
Closed
75 of 77 tasks

Epic: Ensure durability for user workspace files #7901

kylos101 opened this issue Jan 28, 2022 · 13 comments
Assignees
Labels
feature: pvc / snapshots meta: stale This issue/PR is stale and will be closed soon team: workspace Issue belongs to the Workspace team type: epic

Comments

@kylos101
Copy link
Contributor

kylos101 commented Jan 28, 2022

Summary

Better protect user data

Context

Sometimes a workspace, node, or workspace cluster fail and the user data cannot be backed up to cloud storage, resulting in data loss. A related incident for a global outage. A related RFC where we are discussing solutions.

Value

By better handling user data, users will trust that even if the Gitpod service is unavailable, once it is online, they will not lose data.

Acceptance criteria

User data is persisted in such a way that even if there is a workspace, node, or cluster failure, the data is accessible to be backed up at a later time.

Tasks

Ops:

Design:

Product changes:

Tests:

Bug

Should solve:

Day 2:

Front logo Front conversations

@kylos101 kylos101 added team: workspace Issue belongs to the Workspace team type: epic labels Jan 28, 2022
@atduarte atduarte changed the title Epic: better handle user data Epic: Ensure durability and availability of user workspace files Jan 31, 2022
@atduarte
Copy link
Contributor

atduarte commented Jan 31, 2022

@kylos101 Few questions related to "users must be able to access their most recent backup for a workspace regardless of workspace status":
1. During the stopping state, would the system be able to distinguish a backup that was done as a result of it from a previous one?
2. From what I understand/recall we store the last 4 backups. Would we be able to provide the WebApp with the links and corresponding timestamps of all of them?

@kylos101 kylos101 changed the title Epic: Ensure durability and availability of user workspace files Epic: Ensure durability user workspace files Jan 31, 2022
@kylos101 kylos101 changed the title Epic: Ensure durability user workspace files Epic: Ensure durability for user workspace files Feb 2, 2022
@kylos101 kylos101 moved this to Scheduled in 🌌 Workspace Team Mar 9, 2022
@aledbf
Copy link
Member

aledbf commented Mar 28, 2022

automate deployment of GCP storageClasses as part of cluster creation operation (specify discard mount option)

this is not required for XFS.

@aledbf
Copy link
Member

aledbf commented Mar 28, 2022

installer: allow to specify storageClass in gitpod.yaml

this can be optional for the first iteration

@kylos101
Copy link
Contributor Author

kylos101 commented Apr 6, 2022

@sagor999 as a heads up, I added a few observability tasks. One of the first ones we'll need (if it doesn't already exist) is the ability to inspect backups and restores now being done with TAR. For example, this way we can measure duration for both.

@kylos101
Copy link
Contributor Author

kylos101 commented Jun 1, 2022

@sagor999 @jenting are there any more integration tests that need to be added for new code we've written? In other words, I see you've fixed existing tests, but wanted to double check for new test needs. For example, one test I can think of, would be a test that kills a pod, relies on a process to backup the orphaned PVC, and then assert that the PVC is gone (because it was snapshotted).

@axonasif
Copy link
Member

axonasif commented Nov 11, 2022

Question: How would someone who ran out of hours get their data back? (re: #14393)
Contact support? It'd be better if they could self-serve.

@SNWCreations
Copy link

Question: Will this change prevent us to download a single file in the workspace? (Will the "Download..." button in the right-click menu of a file still available?)
Sometimes, I need to update my artifact on another server by downloading the artifact from Gitpod server and upload it to my server manually.

@svenefftinge
Copy link
Member

Question: Will this change prevent us to download a single file in the workspace? (Will the "Download..." button in the right-click menu of a file still available?)

No, this is about downloading the workspace content backup. You can still download individual files from your running workspace depending on how you connect to it. E.g. with Vs Code, just drag and drop.

@6uliver
Copy link

6uliver commented Nov 18, 2022

Maybe this issue should be part of this epic to not lose my workspace's content on a regular basis: #11183

@atduarte
Copy link
Contributor

Update:
Blocker functional issues, and significantly increased workspace startup times were found on the current technical design. 😞

After internal discussions, given backup success ratio is high and stable following adjacent improvements, and that the implementation of the new design will be considerably faster to do after #11416, we have decided to pause this effort until then.

PS: @6uliver I believe the root cause of that issue is different from the context of this one. I will follow-up on that one there. 🙏

@atduarte atduarte removed the status in 🌌 Workspace Team Nov 24, 2022
@stale
Copy link

stale bot commented Sep 16, 2023

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the meta: stale This issue/PR is stale and will be closed soon label Sep 16, 2023
@github-actions github-actions bot removed the meta: stale This issue/PR is stale and will be closed soon label May 23, 2024
Copy link
Contributor

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@github-actions github-actions bot added the meta: stale This issue/PR is stale and will be closed soon label Aug 22, 2024
@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature: pvc / snapshots meta: stale This issue/PR is stale and will be closed soon team: workspace Issue belongs to the Workspace team type: epic
Projects
No open projects
Status: No status
Development

No branches or pull requests

8 participants