-
Notifications
You must be signed in to change notification settings - Fork 14.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task set to be failed by scheduler directly, while all up streams success #17370
Comments
Thanks for opening your first issue here! Be sure to follow the issue template! |
Could you post some context around the log? That log line simply says the task has failed, which we already know from your description. Something might have happened before that to trigger the failure state. |
That’s unfortunate. Maybe someone would be able to spot some pattern. |
@cctvzd7 can you add a dag that can be used to reproduce this? |
Is there anything in the normal scheduler log (the above is the parsing log)? |
@jedcunningham @uranusjr
|
May be related with: #16625 |
@jedcunningham @uranusjr |
@ephraimbuddy I don't think that #16301 solves #16625, the code in that PR is only ran when the container has been started. But the issue in #16625 happens when a pod has never started but did fail. I added some more info on the issue #16625 maybe this sheds a bit more light on what might be going on. |
Oh, I see, we even had an issue with task being stuck in queued because a POD had an error starting or something else happened, and the executor report that the task has failed while the scheduler still sees it as queued. We made this change #15929 which has not been released to resolve the task getting stuck. |
Ah I thought this code was already active in our environment but it isn't 😅 If you could look into making this more robust that would be great. If I can provide more info or help please let me know |
This is a duplicate of #18401 right? |
This issue has been automatically marked as stale because it has been open for 365 days without any activity. There has been several Airflow releases since last activity on this issue. Kindly asking to recheck the report against latest Airflow version and let us know if the issue is reproducible. The issue will be closed in next 30 days if no further activity occurs from the issue author. |
This issue has been closed because it has not received response from the issue author. |
Apache Airflow version: 2.1.1
Kubernetes version (if you are using kubernetes) (use
kubectl version
):Environment:
uname -a
):Linux e08g09383.cloud.eu13 3.10.0-327.ali2010.rc7.alios7.x86_64 #1 SMP Thu Jun 29 21:45:21 CST 2017 x86_64 x86_64 x86_64 GNU/Linux
I installed the airflow via docker-compose.yaml, and my docker-compose.yaml is as attach file: docker-compose.txt
What happened:
A task was set to be failed directly by scheduler, while all up streams were success.
The task has not be run, no log info either.
What you expected to happen:
The task should be running
How to reproduce it:
It happens by chance, I can not reproduce it.
Anything else we need to know:
A log from scheduler:
The text was updated successfully, but these errors were encountered: