-
Notifications
You must be signed in to change notification settings - Fork 262
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JobSet stays in suspend state if kueue is managing it #3349
Comments
Workload says its admitted;
But the jobset is suspended:
If I submit this jobSet without the kueue label, the workload runs without issue. |
Kueue manager logs are logging this error. |
We already fixed #3132 on 0.9. |
I think it might be a good idea indeed. The fix does not require API changes. We deferred due to possibly many conflicts but I think it is worth trying. Could you please try to prepare a minimal cherry - pick so that we can assess what it entails? |
/assign |
SGTM |
I think I'm ok with with that - no API / schema changes in the diffs, but the changes are big, so let me confirm with @tenzen-y . Actually, we discussed the cherry-picking before and the main argument was that we still have time before release of new CRDs, which is proven wrong by the issue. OTOH, we are just a week from releasing 0.9.0, and based on the comment #3349 (comment) @kannon92 could probably mitigate by using 0.9.0-rc.1 |
Yes, that's right. However, the discussion result was based on the already resolved RayJob issue. |
This could be an option indeed. If this is less changes I'm ok to also start with that |
@mbobrovskyi, Could you check if we can upgrade the JobSet module version with fewer changes? |
Ah, it's require to upgrade the Kubernetes version to v0.31.1. And there are a lot of changes :) |
in that case let's go with the fix for field dropping |
Thank you all! My hope was to test Kueue with released containers for Kubecon so using the rc isn’t ideal. Either way I think having this change for 0.8 will be useful as 0.9 requires 1.31 so this will be helpful. |
/close Due to fixed by #3358. |
@mbobrovskyi: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
FYI we are going to release 0.8.2 which will include the fix: #3371 |
What happened:
If I submit a simple JobSet with kueue, the workload stays in a suspend state.
What you expected to happen:
Kueue will unsuspend and the workload will run successfully.
How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
JobSet is 0.7.0.
Environment:
kubectl version
):git describe --tags --dirty --always
): 0.8.1cat /etc/os-release
):uname -a
):The text was updated successfully, but these errors were encountered: