-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sched fails to load hwloc data from KVS at scale #1361
Comments
The error message is coming out of here That's caused by this function returning an error code. But the trace isn't enough to determine which sub-functions within @trws: can you throw some I think "Success" error is produced, because |
I found it, and fixed it, the reader is hard-coded to the resources it will take, and sierra has a new one. WIll upload patch when time. |
Great! |
The sched module is logging with And yes, if the module's
Probably this should eventually be rethought, there isn't going to be enough detail available in an errno to be of any use, and flux-core should just log that the module exited with nonzero status, and let the module log a more detailed reason why. |
Pointer (or cut & paste) of patch would be appreciated so we can fix this one on master. |
It's right here over in sched. The "group" type was unhandled. If possible it would be really good to switch this over to something more generic like what resource.c uses, but this gets it done for sierra. |
It seems this should go into |
Closing - reopened as flux-framework/flux-sched#308 |
@trws. And yes, hwloc reader support should go into the |
With 500 brokers, the following error is produced:
I find it especially odd that the reported error is "Success." This is being reported here rather than there to tie to the tracking issue and because I'm not sure which end the problem is on.
The text was updated successfully, but these errors were encountered: