-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
per-queue user limits #402
Comments
Thanks for opening this @ryanday36. To the best of my knowledge, I think flux-accounting at the moment is most capable of enforcing the max running jobs per-user in a given queue. It already enforces a max running jobs limit per-user across all of their jobs, so I think enforcing it per queue would be reasonable. (mostly thinking out loud here) This would entail:
This includes the assumption that the @grondo not sure if you have any suggestions on my thought process outlined above of how implementing this might work, or if I made any dumb mistakes above and forgot to include something, but any feedback/suggestions here would be welcome. :-) |
I could also be wrong here, but I believe enforcing a max nodes per-user limit, both per-queue and in general, would require some coordination between flux-accounting and other Flux components, as nicely summarized by @grondo in a comment in #349:
It has been a little while since we've discussed this, though, so perhaps we are better suited to tackle this now than before. |
we've had a request for more limits that we can set on specific queues. We can currently set limits on how many resources a specific job can use (max nodes, min nodes, wall time). We'd also like to set limits on how many resources a specific user's running jobs can use in a given queue. Specifically, we'd like to be able set the following for a queue:
max running jobs per user
max nodes per user (across all running jobs, similar to #349)
I've also been giving some thought to whether some sort of 'max committed node-hours per user' (total nodes*requested walltime of all of a users running jobs) would be useful. I'm not so sure about that though.
Tasks
The text was updated successfully, but these errors were encountered: