-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integration with Kueue #12363
Comments
Also relevant kubernetes/kubernetes#121681 |
I read the above two issues and I'm not sure what the next step would be for Argo here. The current Workflow The |
Maybe we can define a layer suspend mechanism (between workflow and steps) and estimate the total resources for next layer. When our quota is reserved, we will perform a one-layer resume. |
I think the next step is to take a step back to understand the following:
Note that Kueue works best when there is a CRD that represents the unit of queueuing. |
I am thinking about this too. If there is no CRD that represents the unit of queueuing for every step, we may need to suspend the whole argo workflow, which is hard to estimate the resources needed. I prefer to let users to choose when to suspend the workflow by adding a suspend template as they do now. Maybe add a property to indicate required podsets to run when suspended is enough. In this way, when workflow is suspended, we can create a workload with the required podsets in workflow status and resume the workflow when workload is admitted. |
Proposal is available for review kubernetes-sigs/kueue#2976 |
Summary
Argo Workflows needs to implement necessary suspend mechanism to work with Kueue. See kubernetes-sigs/kueue#74 for more details.
Message from the maintainers:
Love this enhancement proposal? Give it a 👍. We prioritise the proposals with the most 👍.
The text was updated successfully, but these errors were encountered: