-
Notifications
You must be signed in to change notification settings - Fork 14.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Airflow scheduler zombies don't go away #1157
Comments
Note: this is on version 1.6.2 |
This is fixed in PR-#855 but needs testing (and rebasing). If I rebase can you please test? |
@bolkedebruin awesome! Yes, I can test. |
@criccomini I updated the PR. Please note that my change involves a behavioral change to airflow (see command line options) |
Closing this issue. I believe it's fixed by #855 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Our monitoring is seeing zombies hang around for a long time:
Here, you can see that the parent scheduler 10331 has a bunch of defunct subprocesses. These are hanging around for many minutes.
This appears to correlate with when I set
--num_runs
to 60 (with 15s job scheduler heartbeat). Prior to this, I didn't have num_runs set for the scheduler. I'm wondering if something isn't right about the way that num_runs is working.Also, I'm using
supervisord
to manage the scheduler. I have autorestart set to always.The text was updated successfully, but these errors were encountered: