-
Notifications
You must be signed in to change notification settings - Fork 17
medrxiv biorxiv download
petermr edited this page May 5, 2020
·
8 revisions
These *rxiv
s have no API but can be accessed by a restful query. The process of download is shown in AMIDownloadTool
and AMIDownloadTest
. These examples can be seen as PoC; our current strategy is to use Ferret
if possible.
These rxiv
s work in 3 or 4 steps when run by a human:
- search/query generates a paged hitlist (e.g. 25 hits per page).
- foreach hitlist link create a landingpage.
- foreach landingpage retrieve (a)
fulltext.html
(b)fulltext.pdf
- (optional) foreach fulltext.html retrieve supplemental files