-
Notifications
You must be signed in to change notification settings - Fork 14.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding loggroomer sidecar to dag processor #30726
Conversation
The standalone dagProcessor is not enabled by default, nor is there anything else in values.yaml, nor is the field equal to the folder name. |
umm, not sure why these static checks were failed :(
Any idea for this issue? Thanks! |
The output is pretty clear: The formatting is wrong.
Any of those should help. |
Thank you @potiuk ! |
Awesome work, congrats on your first merged pull request! You are invited to check our Issue Tracker for additional contributions. |
{{- if .Values.dagProcessor.extraVolumeMounts }} | ||
{{- tpl (toYaml .Values.dagProcessor.extraVolumeMounts) . | nindent 12 }} | ||
{{- end }} | ||
{{- if or .Values.webserver.webserverConfig .Values.webserver.webserverConfigConfigMapName }} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess having webserver-config
in the list of volumeMounts
requires declaration in volumes
as well (missing in this PR). Here is an example from triggerer. Without it for me Helm chart 1.10.0 is not working.
This PR attaches the logGroomer sidecar pod when using a standalone dag processor. This is to prevent scheduler logs from growing infinitely in the logs directory of that dag processor pod.
In fact, one of the Airflow clusters my team uses had about 3.5 TiB of scheduler logs accumulated in the emptyDir of a dag processor pod, which reduced the ephemeral-storage availability on that node to the point of pod eviction, resulting in a pod eviction.
airflow@airflow-test-dag-processor-78f9bfdb88-hmckb:/opt/airflow/logs$ ls
scheduler
airflow@airflow-test-dag-processor-78f9bfdb88-hmckb:/opt/airflow/logs$ du -sh
3.5T .
We haven't figured out why the standalone dag processor was accumulating scheduler logs, but we think it's a good idea to attach a logGroomer sidecar like any other scheduler pod or worker pod to prevent this from happening in the first place.
In this PR, we've modified the helm chart to attach a logGroomer to the standalone dag processor.
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.