Replies: 3 comments 6 replies
-
i think it might be even slower to store dag files to s3 bucket. which airflow version are you using? and do you use KubernetesExecutor or CeleryExecutor? |
Beta Was this translation helpful? Give feedback.
-
Yes. the EFS has performance issues that others complained about - they were usually solved by reserving IOPS for them (basically adding money to get better performance). I personally recommend going git-sync route, for me this is the most stable and easiest way to make good performance of file syncing works (and you usually anyhow keep your DAG files in Git more often than not). You can use S3 if you make some S3 -> local file syncing but you are basically on your own to do such solution (though MWAA is apparently using it so it is possible). Basically you have to make sure that all the components of airlfow see the DAG folder in specific location and files are synced there. There is no "place" in Airflow to do that - airlflow simply reads the files from that folder and how the files are synced to the folder is out-of-the-realm of Airflow as application. This is generally a deployment issue how you sync the files. In our official chart you can see some of the options how syncing can be done https://airflow.apache.org/docs/helm-chart/stable/manage-dags-files.html - and take it as a "base". Or (actually I'd recommend that) you could use our chart to deploy Airflow - it's rather versatile and solves a lot of deployment issues like that. |
Beta Was this translation helpful? Give feedback.
-
Is it possible to run mount-s3 with dockers entry-point? that should mount s3-bucket with startup which then could be pointed as dags_folder. seems like except editing a file, reading and writing is supported by it. |
Beta Was this translation helpful? Give feedback.
-
Hello Airflow community,
We set up Airflow on an Amazon EKS cluster, with an Amazon EFS for DAG files storage but it created several issues of performance, and I was wondering if storing these DAG files into a S3 bucket was possible and, if not, which part of code I should modify because I cannot find where the DAG files reading is actually made.
Thank you in advance,
Beta Was this translation helpful? Give feedback.
All reactions