Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

testsuite: start/stop all queues with --all option #992

Merged
merged 1 commit into from
Dec 21, 2022

Conversation

chu11
Copy link
Member

@chu11 chu11 commented Dec 8, 2022

Problem: flux-core PR 4776 added the ability for queues to be individually started and stopped. As an update, starting/stopping all queues with the flux queue command now requires the --all option to be specified. This is not specified in several flux-sched tests, leading to test failures.

Update all callers to specify --all when multiple queues are started or stopped.

This should not be merged until flux-framework/flux-core#4776 is merged

@garlick
Copy link
Member

garlick commented Dec 9, 2022

I think I'm contradicting myself from a couple of days ago, but should we just allow flux queue start and flux queue stop to apply to all queues as before and avoid this breakage? If queues are configured and --all is unspecified, we could print a message like "warning: applying this command to all queues"?

It would be nice not to break an external interface, at least not without a transition period. What do you think?

@chu11
Copy link
Member Author

chu11 commented Dec 9, 2022

ok, lets move to try and make flux queue start/stop backwards compatible. closing

@chu11 chu11 closed this Dec 9, 2022
@chu11 chu11 reopened this Dec 9, 2022
@chu11
Copy link
Member Author

chu11 commented Dec 9, 2022

re-opening, although we're supporting backwards compatibility, we would prefer that tests do not output an unnecessary warning message. So we'd still like to do this change. It is just no longer "breaking" if we don't do it. We can merge at our leisure after flux-framework/flux-core#4776 is merged.

Problem: flux-core PR 4776 added the ability for queues to be
individually started and stopped.  As an update, starting/stopping
all queues with the `flux queue` command now requires the --all option
to be specified, otherwise a warning is output.

To avoid the warning, update all callers to specify --all when multiple
queues are started or stopped.
@codecov
Copy link

codecov bot commented Dec 10, 2022

Codecov Report

Merging #992 (d29c093) into master (dd3a26d) will not change coverage.
The diff coverage is n/a.

@@          Coverage Diff           @@
##           master    #992   +/-   ##
======================================
  Coverage    74.1%   74.1%           
======================================
  Files          78      78           
  Lines        9325    9325           
======================================
  Hits         6910    6910           
  Misses       2415    2415           

Copy link
Member

@garlick garlick left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@chu11 chu11 added the merge-when-passing mergify.io - merge PR automatically once CI passes label Dec 21, 2022
@mergify mergify bot merged commit 93ebe75 into flux-framework:master Dec 21, 2022
@chu11 chu11 deleted the flux_core_pr4776 branch January 10, 2023 22:41
garlick added a commit to garlick/flux-core that referenced this pull request Jan 27, 2023
Problem: if scheduler cannot reallocate resources to a running job,
the scheduler interface is torn down, requiring sys admin intervention.

This was seen in conjunction with flux-framework/flux-sched#992.

It's not really necessary for this to be fatal to the instance.
Raise a fatal exception on the job and let it be cleaned up in the
usual way.
garlick added a commit to garlick/flux-core that referenced this pull request Jan 28, 2023
Problem: if scheduler cannot reallocate resources to a running job,
the scheduler interface is torn down, requiring sys admin intervention.

This was seen in conjunction with flux-framework/flux-sched#992.

It's not really necessary for this to be fatal to the instance.
Raise a fatal exception on the job and let it be cleaned up in the
usual way.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
merge-when-passing mergify.io - merge PR automatically once CI passes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants