-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
job-manager: implement state synchronization (many job) #2094
Comments
Does event-based notification imply only instance owners will have access to bulk state synchronization?
Instead of another |
It depends on whether we set the privacy bit on the event. If there is not a good reason to specify that these events are private, I guess I would advocate for making the interface accessible to all. Not much is exposed here that would be sensitive.
Excellent thought! I don't see why we couldn't do that. |
Ok, great! I had forgotten about the ability to disable privacy of event messages. |
One point brought up by @SteVwonder for e.g. the simulator use case:
could be subject to a race if 1) is "fire and forget", e.g. the subscrption is not in place until some later time, resulting in missed job state changes if the workload is immediately started. I double checked and Here I should reference #1557 which notes that this API call should actually return a future since it hides an RPC, and also that if someone digs down to fix this, they will find there is a "connector operation" implementing it rather than a simple RPC. The connector operation is no longer justified and should probably be replaced with a more straightforward approach. Anyway, the point above was a good one to consider, but it should not be a problem with the current implementation of event subscriptions. |
Not sure, but does the current implementation of |
This issue has been automatically marked as stale because it has not had activity for 365 days. It will be closed if no further activity occurs within 14 days. Thank you for your contributions. |
I think this issue has been overtaken by other discussions on workflow management, etc.. so closing. |
We need a scalable method for entities like workflow managers, the simulator, and other tools, to be notiifed in bulk of job state transtions. This is distinct from the single job synchronization interface proposed in #1665. The driver for a separate bulk interface is scalability.
One approach would be to publish events like we did in the old system. This has some appeal due to its simplicity, and lack of impact (or at least uniform impact) on the job-manager. To improve scalability we could batch state transitions to reduce the number of event messages, e.g. publish a JSON dictionary mapping states to arrays of jobids entering that state, slightly delayed. We could put an API in front of the event messages, to arrange for a callback for each job, e.g.
Some other questions:
The text was updated successfully, but these errors were encountered: