-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow more MDT flexibility in Rabbit lustre allocations #171
Comments
Some documentation is here https://nearnodeflash.github.io/dev/guides/directive-breakdown/readme/ |
jameshcorbett
added a commit
to jameshcorbett/flux-coral2
that referenced
this issue
Jul 15, 2024
Problem: as described in issue flux-framework#171, creating many MDTs is a bad for performance, and usually goes against what is explicitly required by directivebreakdown resources. However, there is not yet a good way to get Fluxion to handle MDT allocation. Bypass Fluxion allocation completely, and tell DWS to create exactly the number of allocations requested in the .constraints.count field (which is usually found on MDTs). Place the allocations on the rabbits which have the most compute nodes allocated to the job. This is intended to be only a temporary solution, since it adds a new potential problem, in that some rabbit storage is used which is not tracked by Fluxion. This could lead to overallocation of resources, causing jobs to fail with errors. However, this seems unlikely to occur in practice, since MDTs are small and Fluxion always gives jobs more storage than they asked for, so there should usually be some spare storage.
jameshcorbett
added a commit
to jameshcorbett/flux-coral2
that referenced
this issue
Jul 15, 2024
Problem: as described in issue flux-framework#171, creating many MDTs is a bad for performance, and usually goes against what is explicitly required by directivebreakdown resources. However, there is not yet a good way to get Fluxion to handle MDT allocation. Bypass Fluxion allocation completely, and tell DWS to create exactly the number of allocations requested in the .constraints.count field (which is usually found on MDTs). Place the allocations on the rabbits which have the most compute nodes allocated to the job. This is intended to be only a temporary solution, since it adds a new potential problem, in that some rabbit storage is used which is not tracked by Fluxion. This could lead to overallocation of resources, causing jobs to fail with errors. However, this seems unlikely to occur in practice, since MDTs are small and Fluxion always gives jobs more storage than they asked for, so there should usually be some spare storage.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Problem: by default, Flux creates one MDT per rabbit for lustre file systems. However,
directivebreakdown
resources list some info which may indicate that fewer (or more) MDTs should be created.@behlendorf said:
Further, creating one MDT per rabbit
Flux should look at directivebreakdowns to see if they offer hints on how many MDTs to create.
The text was updated successfully, but these errors were encountered: