Update Europeana to collect image dimensions #1484
Labels
💻 aspect: code
Concerns the software code in the repository
✨ goal: improvement
Improvement to an existing user-facing feature
🟩 priority: low
Low priority and doesn't need to be rushed
🧱 stack: catalog
Related to the catalog and Airflow DAGs
Problem
We'd like to make sure that all of our image provider scripts are collecting data about image dimensions.
Description
The Europeana script does not currently do so, but the information is available in the API via the single record endpoint. We should update the provider script to collect this info.
There is precedent for making an additional API request per record, notably Flickr but also some others like Metropolitan and NYPL. That being said we should make sure that adding it here doesn't have any dramatic effect on performance.
Additional context/dependent PRs
Note that Europeana is currently turned off in production, because turning it on will initiate a backfill. We should wait for WordPress/openverse-catalog#644 to make sure that we can omit the DAG from Slack notifications, to prevent flooding the channel.<- This has been merged.We also have an issue to update Europeana to use the new API endpoint, and update any necessary fields: #1727. It may be helpful or even necessary to do this work first.<- WordPress/openverse-catalog#974 has been merged.Implementation
The text was updated successfully, but these errors were encountered: