Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v7r3] Kill sweep jobs that take more than 30 minutes #7148

Merged
merged 1 commit into from
Aug 2, 2023

Conversation

chrisburr
Copy link
Member

Occasionally the sweeper gets stuck (probably due to a network glitch) and then waits for 6 hours before it gets killed. As we limit the concurrency of the job to 1 this prevents any sweeping for the full time.

I think it's preferable to set a shorter limit to avoid confusion from this.

Also we don't need to worry about lost jobs as the sweeper is designed to be eventually consistent rather than requiring a specific PR's merge CI to pick it up.

@DIRACGridBot DIRACGridBot added the alsoTargeting:rel-v8r0 Cherry pick this PR to rel-v8r0 after merge label Aug 2, 2023
@fstagni fstagni merged commit 0c44a0e into DIRACGrid:rel-v7r3 Aug 2, 2023
@DIRACGridBot DIRACGridBot added the sweep:done All sweeping actions have been done for this PR label Aug 2, 2023
DIRACGridBot pushed a commit to DIRACGridBot/DIRAC that referenced this pull request Aug 2, 2023
@DIRACGridBot
Copy link

Sweep summary

Sweep ran in https://github.com/DIRACGrid/DIRAC/actions/runs/5738651612

Successful:

  • rel-v8r0

@chrisburr chrisburr deleted the sweep-timeout branch August 2, 2023 14:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
alsoTargeting:rel-v8r0 Cherry pick this PR to rel-v8r0 after merge sweep:done All sweeping actions have been done for this PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants