Proposal for Quay as a Proxy Cache #13

Sunandadadi · 2021-12-10T16:14:07Z

No description provided.

enhancements/proxy-cache.md

flavianmissi

This looks great, thanks for taking the time to putting it down into words.
Left some small comments. I'll hold the approval until we discuss it with the rest of the team.
Great work! 🚀

enhancements/proxy-cache.md

HammerMeetNail · 2021-12-14T13:57:49Z

enhancements/proxy-cache.md

+
+### Goals
+
+* A Quay user can define and configure(credentials, staleness period) via config file/app, a repository in Quay that acts a cache for a specific upstream registry.


Is this saying a Quay repo is mapped to an entire upstream registry, like docker.io? A couple bullets down it sounds like a Quay org is mapped to a registry.

From the epic's user stories:

As a administrator I want to be able to select from caching an entire upstream registry (e.g. cache all of docker.io, i.e. docker.io/library/postgres:latest -> quay.corp/cache/library/postgres:latest) or just a selected namespaces (e.g. just cache docker.io/library, i.e. docker.io/library/postgres:latest -> quay.corp/docker-cache/postgres:latest) so that I can constrain access to potentially untrusted upstream registries

Maybe we should add all users stories to the enhancement so there's no confusion.
Wdyt?

I think it should be the other way around: first we need an enhancement proposal that describes a feature the way it will look in the next release, then we discuss it and accept it, and then we can create stories and implement it.

The enhancement proposals should be detailed enough so that they are useful to the documentation team, QA, support engineers, early adopters that use nightly builds. If we change some of our decisions, we need to update the EPs to keep them useful.

Note: some my comments have questions that are beyond the detalisation of EPs. I'm trying to understand it deeper and find problems that can make it unimplementable. Implementation details that are not visible to users can be omitted in EPs, but we still may need to discuss them.

enhancements/proxy-cache.md

HammerMeetNail · 2021-12-14T14:14:57Z

@Sunandadadi This looks good, a few comments. I'm curious, are all of these changes backend? Is there any UI work expected?

dmage · 2021-12-15T10:01:49Z

enhancements/proxy-cache.md

+
+  * If the upstream image and cached version of the image are different:
+    * The user is authenticated into the upstream registry and only the changed layers of `postgres:14` are pulled.
+    * The new layers are updated in cache and served to the user in parallel.


What will happen if the same uncached blob is requested by multiple clients at the same time?

I'm not sure. I suspect that the desired behaviour is that we don't end up with duplicates blobs. Do we care about how we end up with that result? 🤔

dmage · 2021-12-15T10:04:24Z

enhancements/proxy-cache.md

+A user initiates a pulls of an image(say a `postgres:14` image) from an upstream repository on Quay. The repository is checked to see if the image is present.
+1. If the image does not exist, a fresh pull is initiated.
+  * The user is authenticated into the upstream registry and all the layers of `postgres:14` are pulled.
+  * The pulled layers are saved to cache and served to the user in parallel.


Will serving and caching reuse the same connection to the upstream registry or will they open two different connections?

I have not looked in depth at the technical feasibility of this, but my idea is to pull a layer from the upstream registry and then cache and stream it. So there should only be one pull against the upstream registry and we'd use the result of that to both cache and stream the layer to the user.

dmage · 2021-12-15T10:11:25Z

enhancements/proxy-cache.md

+
+### Goals
+
+* A Quay user can define and configure(credentials, staleness period) via config file/app, a repository in Quay that acts a cache for a specific upstream registry.


Can you describe how the configuration process will look like? Will it be a parameter in the configuration file or will it be configured in the web console (or Quay API)? If the web console, will it be available for all users or only to the superuser?

Good point, we need to at least mention how the setup will be addressed here.
From the user stories in the epic, I understand that regular users should be able to set up proxy orgs:

As a Quay user I want to be able to define an organization in Quay that acts a cache for a specific upstream registry

As a Quay user I want to be able to supply credentials to the upstream registry when defining a cache organization so that I can circumvent / extend possible pull-rate limits or access private repositories

Good point. I will look into this and update once we have a clearer understanding.

enhancements/proxy-cache.md

dmage · 2021-12-15T10:45:34Z

enhancements/proxy-cache.md

+  * If the upstream image and cached version of the image are same:
+    * No layers are pulled from the upstream repository and the cached image is served to the user.
+
+  * If the upstream image and cached version of the image are different:


If the upstream image is deleted? Needs new credentials?

Depending on the staleness period the client would either get the error propagated to them (image outside staleness period) or get the version quay has in cache (image within staleness period).

dmage · 2021-12-15T10:46:14Z

enhancements/proxy-cache.md

+    ![](https://user-images.githubusercontent.com/11522230/145871778-da01585a-7b1b-4c98-903f-809c214578da.png)
+    Design credits: @fmissi
+
+2. If the image exists in cache:


If the image is pulled by SHA, do we need to perform upstream checks?

Shouldn't need to. Since digests are content addressable.

enhancements/proxy-cache.md

kleesc · 2021-12-15T18:10:00Z

enhancements/proxy-cache.md

+  images from cache are evicted based on LRU.
+* A proxy cache organization will transparently cache and stream images to client. The images in the proxy cache organization should
+  support the default expected behaviour (like security scanner, time machine, etc) as other images on Quay.
+* Given the sensitive nature of accessing potentially untrusted upstream registry all cache pulls needs to be logged (audit log).


Pulls are already logged. This can probably be inferred based on the namespace and whether it's set up as a proxy cache or not. Or maybe add additional metadata to the existing pull audit log for these cases.

@kleesc do you mean we currently log pulls done by clients pulling from quay?

I understand this as being about logging the pulls performed by quay against the upstream registry, not necessarily the pulls performed by the client.

kleesc · 2021-12-15T18:12:14Z

enhancements/proxy-cache.md

+* A proxy cache organization will transparently cache and stream images to client. The images in the proxy cache organization should
+  support the default expected behaviour (like security scanner, time machine, etc) as other images on Quay.
+* Given the sensitive nature of accessing potentially untrusted upstream registry all cache pulls needs to be logged (audit log).
+* A Quay user can flush the cache to eliminate excess storage consumption.


I don't think this should be handled manually by the user. It should be done automatically, based on some policy that we decide on. e.g Removing the oldest cached tags/repos.

The automatic removal will exist along side with the manual flush. Manual flush is specifically mentioned in the user stories on the epic:

As a Quay admin I want to be able to leverage the storage quota of an organization to limit the cache size so that backend storage consumption remains predictable by discarding images from the cache according to least recently used or pull frequency

[...]

As a user I want to be able to flush the cache so that I can eliminate excess storage consumption

If you think this shouldn't be the case then we need to bring it up with Daniel when he's back.

enhancements/proxy-cache.md

kleesc · 2021-12-15T18:15:51Z

enhancements/proxy-cache.md

+    ![](https://user-images.githubusercontent.com/11522230/145871778-da01585a-7b1b-4c98-903f-809c214578da.png)
+    Design credits: @fmissi
+
+2. If the image exists in cache:


Shouldn't need to. Since digests are content addressable.

enhancements/proxy-cache.md

kleesc · 2021-12-15T18:23:29Z

enhancements/proxy-cache.md

+
+### Constraints
+
+* If quota management is enabled with proxy cache organization, and say an org is set to a max quota of 500mb and the image the user wants to pull is 700mb.


Same comment as above as to what quota means in this context.

enhancements/proxy-cache.md

flavianmissi

Great work, Sunanda! 🎉
Let's iterate on this as necessary with the rest of the team after holidays.

Sunandadadi · 2021-12-20T16:46:29Z

@Sunandadadi This looks good, a few comments. I'm curious, are all of these changes backend? Is there any UI work expected?

@HammerMeetNail Yes, there will be UI work to configure a proxy cache organization where users can configure credentials, upstream repo path, staleness period and such

HammerMeetNail

LGTM 👍

Can we back burner any UI work that's needed? There's a frontend rewrite planned for next year that this work might fall nicely into.

Sunandadadi force-pushed the proxy_cache branch from aab124d to ec28d31 Compare December 10, 2021 22:17

flavianmissi reviewed Dec 13, 2021

View reviewed changes

enhancements/proxy-cache.md Show resolved Hide resolved

HammerMeetNail reviewed Dec 13, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

Sunandadadi force-pushed the proxy_cache branch 2 times, most recently from 8f1f552 to 98a29fd Compare December 14, 2021 04:14

Sunandadadi changed the title ~~WIP: Proposal for Quay as a Proxy Cache~~ Proposal for Quay as a Proxy Cache Dec 14, 2021

Sunandadadi requested review from kleesc, flavianmissi and HammerMeetNail December 14, 2021 04:15

flavianmissi reviewed Dec 14, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

enhancements/proxy-cache.md Show resolved Hide resolved

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

Sunandadadi force-pushed the proxy_cache branch from 98a29fd to 2928473 Compare December 14, 2021 13:37

HammerMeetNail reviewed Dec 14, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

HammerMeetNail reviewed Dec 14, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

HammerMeetNail reviewed Dec 14, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

HammerMeetNail reviewed Dec 14, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

dmage reviewed Dec 15, 2021

View reviewed changes

enhancements/proxy-cache.md Show resolved Hide resolved

dmage reviewed Dec 15, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

dmage reviewed Dec 15, 2021

View reviewed changes

enhancements/proxy-cache.md Outdated Show resolved Hide resolved

dmage reviewed Dec 15, 2021

View reviewed changes

kleesc reviewed Dec 15, 2021

View reviewed changes

Sunandadadi force-pushed the proxy_cache branch from 2928473 to ec48d3d Compare December 20, 2021 06:25

Proposal for Quay as a Proxy Cache

b96015a

Sunandadadi force-pushed the proxy_cache branch from ec48d3d to b96015a Compare December 20, 2021 16:39

flavianmissi approved these changes Dec 20, 2021

View reviewed changes

Sunandadadi requested review from kleesc, HammerMeetNail and dmage December 20, 2021 18:06

HammerMeetNail approved these changes Dec 20, 2021

View reviewed changes

Sunandadadi merged commit 19adb79 into quay:main Dec 21, 2021

Sunandadadi deleted the proxy_cache branch December 21, 2021 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal for Quay as a Proxy Cache #13

Proposal for Quay as a Proxy Cache #13

Sunandadadi commented Dec 10, 2021

flavianmissi left a comment

HammerMeetNail Dec 14, 2021

flavianmissi Dec 16, 2021

dmage Dec 17, 2021

HammerMeetNail commented Dec 14, 2021

dmage Dec 15, 2021

flavianmissi Dec 16, 2021

dmage Dec 15, 2021

flavianmissi Dec 16, 2021

dmage Dec 15, 2021

flavianmissi Dec 16, 2021

Sunandadadi Dec 20, 2021

dmage Dec 15, 2021

flavianmissi Dec 16, 2021

dmage Dec 15, 2021

kleesc Dec 15, 2021

kleesc Dec 15, 2021

flavianmissi Dec 16, 2021

flavianmissi Dec 20, 2021

kleesc Dec 15, 2021

flavianmissi Dec 16, 2021 •

edited

Loading

kleesc Dec 15, 2021

kleesc Dec 15, 2021

flavianmissi left a comment

Sunandadadi commented Dec 20, 2021

HammerMeetNail left a comment


		### Goals

		* A Quay user can define and configure(credentials, staleness period) via config file/app, a repository in Quay that acts a cache for a specific upstream registry.


		### Constraints

		* If quota management is enabled with proxy cache organization, and say an org is set to a max quota of 500mb and the image the user wants to pull is 700mb.

Proposal for Quay as a Proxy Cache #13

Proposal for Quay as a Proxy Cache #13

Conversation

Sunandadadi commented Dec 10, 2021

flavianmissi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HammerMeetNail commented Dec 14, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

flavianmissi Dec 16, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

flavianmissi left a comment

Choose a reason for hiding this comment

Sunandadadi commented Dec 20, 2021

HammerMeetNail left a comment

Choose a reason for hiding this comment

flavianmissi Dec 16, 2021 •

edited

Loading