Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3-based layer uploads are duplicated #331

Open
jay-dee7 opened this issue May 6, 2023 · 0 comments
Open

S3-based layer uploads are duplicated #331

jay-dee7 opened this issue May 6, 2023 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@jay-dee7
Copy link
Member

jay-dee7 commented May 6, 2023

Describe the bug
When we push a container image, it's logically split into multiple paths inside an S3 bucket. Eg, for user johndoe pushing johndoe/test image, this is how we store it in S3:

  • Image Layer Hash -> sha256:someuniquehashvalue
  • Layer UUID -> 093f2a0e-95b6-40ad-aae5-56df1beb546c - UUID V4 (always unique, no matter what)
  1. layer/093f2a0e-95b6-40ad-aae5-56df1beb546c
  2. johndoe/test/manifests/<tags>

Now, this layers/ path is shared across users & it works like a pool of layers. This was designed like this so that we can check if we already have a layer and reuse it.
However, we're comparing the UUIDs of upload session, which are always unique. Since the UUIDs never match, we end up uploading the container image layer every time. This sort of nullifies the optimization & wastes storage.

To Reproduce
Upload a container image with a tag, then re-tag the image with a different tag value without making any modifications to the container image itself. You'll now have 2X the layers but there should only be 1 set of layers in the layers/ directory (path).

Expected behavior
If a layer already exists, the API should return either a success response or mount the layer.

Screenshots

The CIDs below clearly show that the data in these layers is the same, but we still have 4 copies.

image

Log Files
nil

Desktop (please complete the following information):

  • OS: MacOS
  • Version 13.3.1 (a)

Additional context
Add any other context about the problem here.

@jay-dee7 jay-dee7 added the bug Something isn't working label May 6, 2023
@jay-dee7 jay-dee7 self-assigned this May 6, 2023
@jay-dee7 jay-dee7 moved this from Todo to In Progress in Containerish Road Map 2023-24 May 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: In Progress
Development

No branches or pull requests

1 participant