Consider LimitRanges when calculating Workload usage #541

alculquicondor · 2023-02-01T21:50:57Z

What would you like to be added:

Administrators can setup LimitRanges per namespace to set default requests for Pods.

We should consider these defaults when creating a Workload.
It can be done in the Workload webhook so that it applies to any custom job, similar to #316.

Caveat: if a LimitRange is created after the Workload object is created, we will not have an accurate calculation of requests.

This issue is an spinoff from #485

Why is this needed:

LimitRanges are a common tool for admins to set defaults for a namespace.

Completion requirements:

This enhancement requires the following artifacts:

Design doc
API change
Docs update

The artifacts should be linked in subsequent comments.

trasc · 2023-02-27T14:48:47Z

/assign

mcariatm · 2023-02-27T14:49:32Z

/assign

trasc · 2023-02-28T15:28:55Z

@alculquicondor Maybe the workload web-hook is not the best way to add this, since we'll need tho adapt the fix for #590.
There will a lot easier if:
a. the defaulting (both for "limits to requests" and LimitsRanges ) is done in the job web-hook
or
b. the job to workload equality ignores the resources needs of the containers

alculquicondor · 2023-02-28T15:41:00Z

a. This means that we would have to do the same for every custom job API (mpi, ray, etc), which adds more complexity to integrations.
b. We cannot completely ignore the resources, because that means that we are not calculating the most up-to-date quota.

You can work independently from #590. Whichever PR is ready later will have to rebase.

alculquicondor · 2023-03-14T15:26:17Z

cc @kerthcet

After some offline discussion, we believe it's better to have a "totalRequests" field in the status. This can be calculated during admission and added in the same API call. This means we don't need to recreate or update the Workloads if LimitRanges or RuntimeClass changes.

kerthcet · 2023-03-15T11:09:18Z

So we'll only update the totalRequests in admission, and after admission, we'll skip the calculation to avoid querying the limitRange, right? Then we can also solve the bug here #590

alculquicondor · 2023-03-15T14:06:49Z

That is correct, that would solve #590 too. After admission, we would use the calculated requests in the cache.

kerthcet · 2023-03-18T03:01:54Z

This is solved? What about the field totalRequests? Or this is closed by mistake.

trasc · 2023-03-18T06:43:21Z

It is solved.
This however has two follow-up's #611 and #612, In the PR for 611 we'll need to be able to record the requests at the time of admission, hence we will need some API change for that.

alculquicondor added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 1, 2023

alculquicondor mentioned this issue Feb 1, 2023

Manage number of pods in ClusterQueues #485

Closed

3 tasks

k8s-ci-robot assigned mcariatm Feb 27, 2023

This was referenced Mar 1, 2023

[workload] Get default resource values from LimitRanges #600

Merged

Resync the workload resource values upon LimitRange changes #611

Closed

Apply LimitRange validation on workloads. #612

Closed

alculquicondor mentioned this issue Mar 10, 2023

☂️ Requirements for release 0.3 #360

Closed

k8s-ci-robot closed this as completed in #600 Mar 17, 2023

kerthcet mentioned this issue Mar 21, 2023

Update release notes for 0.3 #656

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider LimitRanges when calculating Workload usage #541

Consider LimitRanges when calculating Workload usage #541

alculquicondor commented Feb 1, 2023 •

edited

Loading

trasc commented Feb 27, 2023 •

edited

Loading

mcariatm commented Feb 27, 2023

trasc commented Feb 28, 2023 •

edited

Loading

alculquicondor commented Feb 28, 2023

alculquicondor commented Mar 14, 2023

kerthcet commented Mar 15, 2023

alculquicondor commented Mar 15, 2023

kerthcet commented Mar 18, 2023

trasc commented Mar 18, 2023

Consider LimitRanges when calculating Workload usage #541

Consider LimitRanges when calculating Workload usage #541

Comments

alculquicondor commented Feb 1, 2023 • edited Loading

trasc commented Feb 27, 2023 • edited Loading

mcariatm commented Feb 27, 2023

trasc commented Feb 28, 2023 • edited Loading

alculquicondor commented Feb 28, 2023

alculquicondor commented Mar 14, 2023

kerthcet commented Mar 15, 2023

alculquicondor commented Mar 15, 2023

kerthcet commented Mar 18, 2023

trasc commented Mar 18, 2023

alculquicondor commented Feb 1, 2023 •

edited

Loading

trasc commented Feb 27, 2023 •

edited

Loading

trasc commented Feb 28, 2023 •

edited

Loading