Releases: sodascience/artscraper
Releases · sodascience/artscraper
Release 0.3.1
Version 0.3.0
Main changes
- Make the package compatible with the new version of selenium (selenium 4).
- Make the package able to handle too-long filenames.
- Make the package able to deal with cases where an artist has a Wikipedia article in a language other than English, or no Wikipedia article at all.
- Make the package able to scrape artworks which have new features on their Google Arts & Culture page (ArtRemix / PoemPostcard).
- Make the package scroll through all artworks by an artist.
What's Changed
- Correct wikidata ID for date of death property in SPARQL query by @modhurita in #18
- Fix SPARQL query timeout problems by @modhurita in #22
- Wait after right arrow is clicked (not before) by @modhurita in #23
- We might need to use driver.quit() by @qubixes in #24
- fix wikipedia-related issues by @modhurita in #27
- Fixed filename too long and empty image file errors, removed redundant example notebooks by @modhurita in #29
- Fix pagination issue by @modhurita in #31
- Account for ArtRemix/PoemPostcard xpath changes by @modhurita in #32
- Example by @jgarciab in #33
- Change version number to 0.3.0 by @modhurita in #34
Full Changelog: v0.2.0...v0.3.0
Release v0.2.0
New features
- Collect urls for all artists.
- Collect artwork urls for each artist.
- Download image and metadata for each artist.
The new features are illustrated in example_collect_all_artworks.ipynb
.
Release v0.1.1
Update pypi description and fix timeout bug
Release v0.1.0-alpha
First release of artscraper. Untested on Windows.