-
Notifications
You must be signed in to change notification settings - Fork 14.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"deadlock detected" error in dag table updates #21947
Comments
Thanks for opening your first issue here! Be sure to follow the issue template! |
I have seen this error with multi-scheduler setup. How many schedulers are you using? |
Just one active scheduler at a time. |
Deadlocks are fine except if they caused the scheduler to stop. From the log above, this happens in your database and does not stop your scheduler. Looks similar to this fix #19842 |
Yep. duplicate of #19842 |
Also seeing this in the same table with a massive number of DAGs (3k+). We are running multi-scheduler. Will #19842 resolve this? We're on
By itself, the deadlocks don't feel like a problem, but when they throw an uncaught exception, the process exits. Since we're running the schedulers in k8s, it trips the restart counter quite a bit, and the more often it happens the higher that goes... not good. Here's a snip of some of the stack trace. It got clipped since this isn't being piped into the termination log so it's capped at the k8s limit:
|
Apache Airflow version
2.0.2
What happened
Likely places which conflict in the code:
airflow/airflow/models/dag.py
Line 2436 in d726579
airflow/airflow/models/dag.py
Line 2919 in d726579
What you expected to happen
No response
How to reproduce
We have 830 dags and this number keeps growing due to business requirements. No special steps to reproduce are required, it just happens every now and then, as would be expected. Increase fillfactor, perhaps?
Operating System
VERSION="20.04.3 LTS (Focal Fossa)"
Versions of Apache Airflow Providers
apache-airflow-providers-celery==1.0.1
apache-airflow-providers-ftp==1.0.1
apache-airflow-providers-http==1.1.0
apache-airflow-providers-imap==1.0.1
apache-airflow-providers-sqlite==1.0.1
Deployment
Other Docker-based deployment
Deployment details
Postgres: psql (11.5 (Ubuntu 11.5-1.pgdg16.04+1))
Anything else
Multiple times a day.
Took remediation steps to reduce amount of rows on page:
which reduced frequency to ~1/day.
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: