Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sched.resource-status RPC hangs when Fluxion is scheduling large jobs #1039

Closed
grondo opened this issue Jun 26, 2023 · 1 comment
Closed

Comments

@grondo
Copy link
Contributor

grondo commented Jun 26, 2023

During the testing in #1009, it was noticed that flux resource list is slow to respond, or doesn't respond at all, while Fluxion is struggling to schedule jobs. This was extremely apparent in the larger job size tests where scheduling a single job takes 10-30s.

We'll require some mitigation for this, since not answering resource-status requests is not acceptable on a system instance, where these queries will be common.

@grondo
Copy link
Contributor Author

grondo commented Mar 25, 2024

Not an issue anymore after flux-framework/flux-core#5796 is merged.

@grondo grondo closed this as completed Mar 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant