-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rc1.d/01-enclosing-instance: write URIs as annotations instead of into kvs? #3103
Comments
Neat idea! From today's mini hackathan with the COVID user, I felt it will go a long way if we have a high level command that can walk the entire instance hierarchy and print out job information hierarchically. Maybe this trick can be extended to make such hierarchical walk automated... |
I wonder if there should be a different command that lists job information hierarchically? If an instance hierarchy is required, then presumably there are too many jobs for a single instance, so recursive job listing should not be encouraged. If there is a small enough number of jobs that recursive listing will be scalable (and listing all jobs is a requirement), then perhaps a hierarchy of instances wasn't required. For the cases where a hierarchy is required to handle a large number of jobs, perhaps a more appropriate tool would display aggregate job information instead of listing jobs recursively? (e.g. count of running, pending, completed, failed..). |
Yes, for large scale cases, we want this in a different command .
Yes. But I was thinking this mostly as usability concerns. Currently From my recent mini-hackathan, it was clear that the following was what the user wanted to see right after we did the first level ƒ(s=4,d=0,builddir) grondo@asp:~$ flux mini batch -n2 --wrap 'flux mini submit sleep 120; flux mini submit sleep 120; flux mini submit sleep 100; flux queue drain'ƒ(s=4,d=0,builddir) grondo@asp:~$ FLUX_URI=$(flux jobs -no {user.local_uri} ƒPpRR8kWB) flux jobs
JOBID USER NAME ST NTASKS NNODES RUNTIME RANKS
ƒXPkH51 grondo sleep PD 1 - - -
ƒRSKAQf grondo sleep R 1 1 48.81s 0
ƒKNx6cw grondo sleep R 1 1 49.05s 0 The new syntax to get this is definitely far better than what we did in our recent mini hackathan! From the perspective of regularly batch users with no CS background, though, I was thinking along the line of Of course, this won't work if the JOBID isn't a flux instance. So Related: Do we want to mark a Flux instance job as a special case to flux listing tools? There seem to be certain additions things like recursive queries you want to be able to do on such jobs? One can argue users can easily find that by querying the name field though. Maybe a wrapper command that take the JOBID and tells if it is a Flux instance or not can make scripting easier. |
This is not a bad idea! In combination with a solution to #2298,
Note also that |
This can be done currently by checking |
I believe this issue has been resolved or superseded by #3986, #4004 and #3999. The broker now notifies parent of instance URIs via the
As of #4004, this now works $ flux proxy JOBID flux jobs and since the $ flux jobs
JOBID USER NAME ST NTASKS NNODES RUNTIME NODELIST
ƒ2VMtahtjZ grondo flux R 4 1 15.05s asp
$ flux proxy ƒ2VMtahtjZ flux jobs
JOBID USER NAME ST NTASKS NNODES RUNTIME NODELIST
ƒre6xGf grondo flux R 1 1 27.12s asp
ƒrccxzL grondo flux R 1 1 27.13s asp
ƒrccxzK grondo flux R 1 1 27.13s asp
$ flux proxy ƒ2VMtahtjZ/ƒrccxzL flux jobs
JOBID USER NAME ST NTASKS NNODES RUNTIME NODELIST
ƒ29yGZLU grondo sleep PD 1 - - -
ƒ29zkYco grondo sleep PD 1 - - -
ƒ29zkYcp grondo sleep PD 1 - - - We can use this groundwork to build a recursive job query tool, but that is a separate issue, so closing this one. |
On slack, a question was asked about the easiest way to run
flux jobs
against a subinstance.We already have
01-enclosing-instance
which writes alocal_uri
andremote_uri
to the parent's kvs namespace for the job. However, these values are not accessible fromflux jobs
, so instead they must be retrieved viaflux job info
.Another idea would be to record these values as job annotations, now that we have this feature. That would make the URIs immediately accessible via
flux jobs
. Here's an example:The text was updated successfully, but these errors were encountered: