Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log instances of delays due to resource limitations and their causes #132

Merged
merged 5 commits into from
Apr 30, 2021
Merged

Log instances of delays due to resource limitations and their causes #132

merged 5 commits into from
Apr 30, 2021

Conversation

13thProgression
Copy link
Contributor

@13thProgression 13thProgression commented Apr 19, 2021

Fixes #129
Include job age any time that a plot is held off
Insert the last delay message into the log if it isn't a standard stagger delay

Copy link
Collaborator

@altendky altendky left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ericaltendorf, looks ok to me. Any concerns adding the stagger info to the other statuses?

@altendky altendky requested a review from ericaltendorf April 22, 2021 02:04
log.log(msg)
plotting_status = '<just started job>'
jobs = Job.get_running_jobs(cfg.directories.log, cached_jobs=jobs)
else:
# If a plot is delayed for any reason other than stagger, log it
if msg.find("stagger") < 0:
aging_reason = msg
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So ... when we start a new plot job, we'll log the reason it had been delayed? That could be useful.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It has helped me with tuning my machines by suggesting what resources are lacking. The only downside that I have encountered is that the Log: count message no longer reflects the total number of jobs launched by plotman since delay messages are also counted.

@altendky
Copy link
Collaborator

@13thProgression, thanks for joining the ranks of plotman developers. Cheers.

@altendky altendky merged commit e1a195c into ericaltendorf:development Apr 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants