From bcc8a3d33efcbfc683cd016ad7a902bfd117d84b Mon Sep 17 00:00:00 2001 From: bobhanson Date: Mon, 4 Nov 2024 17:36:04 -0600 Subject: [PATCH] docs/index.html --- docs/index.htm | 0 docs/index.html | 19 +++++++++++++++++++ 2 files changed, 19 insertions(+) delete mode 100644 docs/index.htm create mode 100644 docs/index.html diff --git a/docs/index.htm b/docs/index.htm deleted file mode 100644 index e69de29b..00000000 diff --git a/docs/index.html b/docs/index.html new file mode 100644 index 00000000..89d14b59 --- /dev/null +++ b/docs/index.html @@ -0,0 +1,19 @@ + +

CDX/CDXML specification

+352 pages retrieved from multiple points on the wayBack machine. + +

CDX/CDXML specification +

+ + +

examples/v6-icl-repository-DOI-crawl

+Output from DOICrawler.java collecting the contents of +the high-performance computing repository at +Imperial College London. At least in Firefox, the links work in the JSON output. + +

IFD.findingaid.json +

ifd-fielURLMap.txt +

crawler.log +

+ + \ No newline at end of file