Recover lost websites from the Web Infrastructure
-
Updated
Feb 10, 2021 - HTML
Recover lost websites from the Web Infrastructure
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
ODU Web Science and Digital Libraries Research Group (WS-DL) home page.
Makes saving pages in bulk to the wayback machine much easier
Wget-compatible web downloader and crawler.
QA Mementos using Screenshots
Add a description, image, and links to the web-archiving topic page so that developers can more easily learn about it.
To associate your repository with the web-archiving topic, visit your repo's landing page and select "manage topics."