Create the new data refresh DAG factory and move initial steps into Airflow #4146
Labels
💻 aspect: code
Concerns the software code in the repository
✨ goal: improvement
Improvement to an existing user-facing feature
🟨 priority: medium
Not blocking but should be addressed soon
🧱 stack: catalog
Related to the catalog and Airflow DAGs
Milestone
Problem
This issue tracks the creation of a new data refresh DAG factory to generate the new data refresh DAGs that will not rely on the ingestion server, and moving the initial steps (described below) into the DAGs. At the end of this step the DAGs will not be functional/able to run a full refresh.
Description
We’ll create a new data refresh DAG factory to generate data refresh DAGs for each existing media_type and environment. Currently these four will be generated:
Because the environment is added as a prefix, there will be no collision with the existing DAG ids. In this initial step, we we will add only a small portion of the logic in order to make the PR easier to review. The first steps are already implemented in the current data refresh and can simply be copied:
We will include new tasks to perform the initial few steps of the ingestion server’s work:
Additional context
See this section of the IP
The text was updated successfully, but these errors were encountered: