web-archiving
Here are 114 public repositories matching this topic...
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Dec 3, 2024 - Python
Free web archiving and sharing service based on Cloudflare. 基于 Cloudflare 的免费网页归档和分享工具。
-
Updated
Dec 3, 2024 - TypeScript
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
-
Updated
Dec 3, 2024 - TypeScript
homepage and platform for chinese trans digital archive
-
Updated
Dec 2, 2024 - TypeScript
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
-
Updated
Dec 1, 2024 - Java
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, mirroring, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
-
Updated
Nov 30, 2024 - Python
Run a high-fidelity browser-based web archiving crawler in a single Docker container
-
Updated
Nov 30, 2024 - TypeScript
Makes saving pages in bulk to the wayback machine much easier
-
Updated
Nov 30, 2024 - HTML
Archive a list of URLs using the Wayback Machine
-
Updated
Nov 29, 2024 - Python
The repository and website hosting the peer review process for new Programming Historian lessons
-
Updated
Dec 2, 2024 - Jupyter Notebook
Serverless replay of web archives directly in the browser
-
Updated
Nov 28, 2024 - TypeScript
CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
-
Updated
Nov 27, 2024 - JavaScript
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
-
Updated
Nov 27, 2024 - TypeScript
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
-
Updated
Nov 23, 2024 - TypeScript
InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
-
Updated
Nov 21, 2024 - Python
CLI implementation of httpreserve that can test links and retrieve internet archive replacements
-
Updated
Nov 21, 2024 - Go
Improve this page
Add a description, image, and links to the web-archiving topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the web-archiving topic, visit your repo's landing page and select "manage topics."