Backfill license_url
field for images where it's null in the meta_data
#3885
Labels
🗄️ aspect: data
Concerns the data in our catalog and/or databases
🧰 goal: internal improvement
Improvement that benefits maintainers, not users
🟨 priority: medium
Not blocking but should be addressed soon
python
Pull requests that update Python code
🧱 stack: catalog
Related to the catalog and Airflow DAGs
🔧 tech: airflow
Involves Apache Airflow
Problem
As a requirement for #703, we must fill the
metadata.license_url
field for images without it. At the moment of writing this issue, the number amounts to 97.7 million in the upstream DB.Description
This potentially could be solved by a one-off DAG that gets the license URL computed from the
license
andlicense_version
fields. @obulat made one DAG previously for filling one of the cases when the license URL is null, see #1005.Additional context
See previous work at #1565.
The text was updated successfully, but these errors were encountered: