-
Notifications
You must be signed in to change notification settings - Fork 54
Conversation
Signed-off-by: Olga Bulat <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Excited for all this code cleanup! Just a few comments. The only other thumbnail
references I'm seeing are in the archive/commoncrawl, which is not being used at the moment and seems like a separate can of worms.
Could we add a test that the thumbnail gets hard-coded to None
, maybe in the image store tests?
openverse_catalog/dags/providers/provider_api_scripts/jamendo.py
Outdated
Show resolved
Hide resolved
openverse_catalog/dags/providers/provider_api_scripts/phylopic.py
Outdated
Show resolved
Hide resolved
openverse_catalog/dags/providers/provider_api_scripts/wordpress.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Olga Bulat <[email protected]>
…ript Signed-off-by: Olga Bulat <[email protected]>
Signed-off-by: Olga Bulat <[email protected]>
Signed-off-by: Olga Bulat <[email protected]>
I tried adding a specific test for
I wonder if it was better to simply use |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is great! I love seeing change counts like this 🤩
This prevents us from acquiring new thumbnail IDs - it would be awesome if we could remove thumbnail_url
altogether from the catalog and the API. Will there be a follow up PR to remove that column from the table? I think it would be a good idea to do that shortly after this 😄
openverse_catalog/dags/providers/provider_api_scripts/brooklyn_museum.py
Outdated
Show resolved
Hide resolved
@@ -94,7 +94,8 @@ def _get_items(): | |||
while should_continue: | |||
query_params = _get_query_params(offset=offset) | |||
batch_data = _get_batch_json(query_params=query_params) | |||
if isinstance(batch_data, list) and len(batch_data) > 0: | |||
logger.info(f"batch_data: {batch_data}, query_params: {query_params}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
batch_data
is going to be a pretty large list in most cases, do you think this could be a logger.debug
line?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It was probably left their by mistake :)
openverse_catalog/dags/providers/provider_api_scripts/jamendo.py
Outdated
Show resolved
Hide resolved
…_museum.py Co-authored-by: Madison Swain-Bowden <[email protected]>
I noted it elsewhere, but will also write it here: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Totally makes sense! 🚀 Do you think it might be worth running an update to set the field to NULL
for all existing images?
Yes, absolutely! |
openverse_catalog/dags/providers/provider_api_scripts/jamendo.py
Outdated
Show resolved
Hide resolved
openverse_catalog/dags/providers/provider_api_scripts/jamendo.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! 🎉 Look at that beautiful stat for lines changed 😄
Fixes
Fixes WordPress/openverse#1561 by @obulat
Description
This PR removes
thumbnail_url
from image provider scripts, and hard-codesthumbnail_url
toNone
in the imageStore class.Testing Instructions
Run the tests - they should pass. Try running the image DAGs and see that they save image thumbnails as NULL.
Checklist
Update index.md
).main
) or a parent feature branch.Developer Certificate of Origin
Developer Certificate of Origin