You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The tasks submitted will have the following requirements 1. Pods in a Job are not the same, perhaps with different resource requirements, perhaps with different node affinities, etc
2. the Gang policy(JobReadyFn) is enabled
In this case, all pods in a Job will use the same Predicate cache. When a Pod fails to Predicate a node, that node will never be considered.This may cause the task to fail.
I think you can't assume that the Pods under the Job are all the same, so you shouldn't use the same cache, and can the Predicate cache switch be configured?
attention: I used a third-party platform to submit the Job, not Volcano Job
Steps to reproduce the issue
Describe the results you received and expected
I think you can't assume that the Pods under the Job are all the same, so you shouldn't use the same cache, and can the Predicate cache switch be configured?
What version of Volcano are you using?
release v1.9.0
Any other relevant information
No response
The text was updated successfully, but these errors were encountered:
Thanks for your feedback. There is already a PR #3649 to improve it.
Thanks, I see it. There are now different scheduling caches based on the TaskRole. However, there are scenarios where pods for the same TaskRole predicate different nodes during custom scheduling plugin development. The current solution reduces the extensibility of predicate extension points. Could you consider making the cache configurable or something else
Thanks for your feedback. There is already a PR #3649 to improve it.
Thanks, I see it. There are now different scheduling caches based on the TaskRole. However, there are scenarios where pods for the same TaskRole predicate different nodes during custom scheduling plugin development. The current solution reduces the extensibility of predicate extension points. Could you consider making the cache configurable or something else
@lowang-bh Can you consider this problem? I think this is an important bug fix. Thanks
Description
The tasks submitted will have the following requirements
1. Pods in a Job are not the same, perhaps with different resource requirements, perhaps with different node affinities, etc
2. the Gang policy(JobReadyFn) is enabled
In this case, all pods in a Job will use the same Predicate cache. When a Pod fails to Predicate a node, that node will never be considered.This may cause the task to fail.
I think you can't assume that the Pods under the Job are all the same, so you shouldn't use the same cache, and can the Predicate cache switch be configured?
attention: I used a third-party platform to submit the Job, not Volcano Job
Steps to reproduce the issue
Describe the results you received and expected
I think you can't assume that the Pods under the Job are all the same, so you shouldn't use the same cache, and can the Predicate cache switch be configured?
What version of Volcano are you using?
release v1.9.0
Any other relevant information
No response
The text was updated successfully, but these errors were encountered: