ROX-15980 set resource requests and limits to the egress-proxy #991

ludydoo · 2023-04-27T10:59:05Z

Sets the resource requests and limits for the egress-proxy.

The values were derived from the observed metrics on prometheus/grafana. These resource defaults are a bit overkill for the actual observed usage. I was not comfortable putting less than this for a production deployment.

It also seems like the values.yaml file was ignored for the tenant-resources. This PR also adds changes to use these values by default and apply overrides on top of them.

The current cluster configuration deploys 1 replica of the egress-proxy, so I've changed the value to reflect that as well.

kylape · 2023-04-27T13:25:22Z

fleetshard/pkg/central/charts/data/tenant-resources/values.yaml

+    limits:
+      cpu: 100m
+      memory: 128Mi
+    requests:


I think you got the request and limit values backwards.

ludydoo · 2023-04-27T15:29:30Z

/retest

kylape · 2023-04-27T17:07:17Z

fleetshard/pkg/central/charts/data/tenant-resources/values.yaml

 egressProxy:
  image: ubuntu/squid:5.2-22.04_beta
-  replicas: 2
-
+  replicas: 1


this doesn't need to be fixed in this PR, but this does make me wonder if we should actually run two replicas in the cloud service for all the usual reasons.

@kylape @porridge perhaps with a nodeAntiAffinity on other egress proxies (preferredDuringScheduling) ?

kylape

lgtm!

porridge · 2023-04-28T10:16:08Z

fleetshard/pkg/central/charts/data/tenant-resources/values.yaml

+      cpu: 100m
+      memory: 128Mi
+    limits:
+      cpu: 200m
+      memory: 256Mi


How about we use requests==limits at least until we have more replicas? We'll likely be setting resources for a bunch of pods in the next rollouts (see sibling PRs) so the risk of facing evictions will be higher than normal and I'm a bit worried with one replica this may cause service degradation for non-trivial central configurations.

I can add the replicas in this PR perhaps?

I wouldn't mind.

openshift-ci · 2023-05-03T13:26:05Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kurlov, kylape, ludydoo, porridge

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [kurlov,porridge]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ROX-15980 set resource requests and limits to the egress-proxy

45ccd77

ludydoo temporarily deployed to development April 27, 2023 10:59 — with GitHub Actions Inactive

ludydoo requested review from porridge and kylape April 27, 2023 10:59

ROX-15980 ensure resources are quoted

641a635

ludydoo temporarily deployed to development April 27, 2023 11:02 — with GitHub Actions Inactive

ROX-15980 use default helm values when deploying tenant resources

2f1a527

ludydoo requested a review from kurlov April 27, 2023 12:49

ludydoo temporarily deployed to development April 27, 2023 12:50 — with GitHub Actions Inactive

ROX-15980 cleanup

f4b266d

ludydoo temporarily deployed to development April 27, 2023 12:53 — with GitHub Actions Inactive

ROX-15980 cleanup

b0dcb10

ludydoo temporarily deployed to development April 27, 2023 12:53 — with GitHub Actions Inactive

ludydoo temporarily deployed to development April 27, 2023 12:54 — with GitHub Actions Inactive

kylape reviewed Apr 27, 2023

View reviewed changes

Update values.yaml

158e62b

ludydoo temporarily deployed to development April 27, 2023 13:36 — with GitHub Actions Inactive

ludydoo requested a review from kylape April 27, 2023 13:43

ROX-15980 Change e2e test

64c5d5e

ludydoo temporarily deployed to development April 27, 2023 16:11 — with GitHub Actions Inactive

kylape reviewed Apr 27, 2023

View reviewed changes

kylape approved these changes Apr 27, 2023

View reviewed changes

openshift-ci bot assigned kylape Apr 27, 2023

openshift-ci bot added the lgtm label Apr 27, 2023

porridge approved these changes Apr 28, 2023

View reviewed changes

openshift-ci bot assigned porridge Apr 28, 2023

openshift-ci bot added the approved label Apr 28, 2023

kurlov approved these changes Apr 28, 2023

View reviewed changes

openshift-ci bot assigned kurlov Apr 28, 2023

ROX-15980 PR comments

7f588c1

openshift-ci bot removed the lgtm label May 2, 2023

ludydoo temporarily deployed to development May 2, 2023 09:30 — with GitHub Actions Inactive

ludydoo requested review from porridge and kylape May 2, 2023 09:30

ROX-15980 typo

35f5928

ludydoo temporarily deployed to development May 2, 2023 09:30 — with GitHub Actions Inactive

kylape approved these changes May 3, 2023

View reviewed changes

openshift-ci bot added the lgtm label May 3, 2023

ludydoo merged commit aad51d1 into main May 3, 2023

ludydoo deleted the ROX-15980-egress-proxy-resources-requests-and-limits branch May 3, 2023 13:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROX-15980 set resource requests and limits to the egress-proxy #991

ROX-15980 set resource requests and limits to the egress-proxy #991

ludydoo commented Apr 27, 2023 •

edited

Loading

kylape Apr 27, 2023

ludydoo Apr 27, 2023

ludydoo commented Apr 27, 2023

kylape Apr 27, 2023

porridge Apr 28, 2023

ludydoo Apr 28, 2023

kylape left a comment

porridge Apr 28, 2023

ludydoo Apr 28, 2023

porridge Apr 28, 2023

openshift-ci bot commented May 3, 2023

ROX-15980 set resource requests and limits to the egress-proxy #991

ROX-15980 set resource requests and limits to the egress-proxy #991

Conversation

ludydoo commented Apr 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ludydoo commented Apr 27, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kylape left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci bot commented May 3, 2023

ludydoo commented Apr 27, 2023 •

edited

Loading