15 Apr 09:41

qubixes

Release 0.3.1 Latest

Latest

Bump version

Assets 2

08 Apr 14:55

modhurita

Version 0.3.0

Main changes

Make the package compatible with the new version of selenium (selenium 4).
Make the package able to handle too-long filenames.
Make the package able to deal with cases where an artist has a Wikipedia article in a language other than English, or no Wikipedia article at all.
Make the package able to scrape artworks which have new features on their Google Arts & Culture page (ArtRemix / PoemPostcard).
Make the package scroll through all artworks by an artist.

What's Changed

Correct wikidata ID for date of death property in SPARQL query by @modhurita in #18
Fix SPARQL query timeout problems by @modhurita in #22
Wait after right arrow is clicked (not before) by @modhurita in #23
We might need to use driver.quit() by @qubixes in #24
fix wikipedia-related issues by @modhurita in #27
Fixed filename too long and empty image file errors, removed redundant example notebooks by @modhurita in #29
Fix pagination issue by @modhurita in #31
Account for ArtRemix/PoemPostcard xpath changes by @modhurita in #32
Example by @jgarciab in #33
Change version number to 0.3.0 by @modhurita in #34

Full Changelog: v0.2.0...v0.3.0

Contributors

modhurita, jgarciab, and qubixes

Assets 2

30 May 08:38

modhurita

Release v0.2.0

New features

Collect urls for all artists.
Collect artwork urls for each artist.
Download image and metadata for each artist.

The new features are illustrated in example_collect_all_artworks.ipynb.

Assets 2

04 Oct 13:16

jgarciab

Release v0.1.1

Update pypi description and fix timeout bug

Assets 2

04 Oct 12:05

jgarciab

Release v0.1.0-alpha Pre-release

Pre-release

First release of artscraper. Untested on Windows.

Assets 2