pga — August 1, 2022 dataset
OTA-Bot
released this
01 Aug 08:30
·
2808 commits
to main
since this release
This dataset consolidates the contractual documents of 19 service providers, in all their versions that were accessible online between April 20, 2022 and August 1, 2022.
This dataset is tailored for datascientists and other analysts. You can also explore all these versions interactively on https://github.com/OpenTermsArchive/pga-versions.
It has been generated with Open Terms Archive.
Dataset format
This dataset represents each version of a document as a separate Markdown file, nested in a directory with the name of the service provider and in a directory with the name of the document type. The filesystem layout will look like below.
├ README.md
├┬ Service provider 1 (e.g. Facebook)
│├┬ Document type 1 (e.g. Terms of Service)
││├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-08-01T01-03-12Z.md)
┆┆┆
││└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-10-03T08-12-25Z.md)
┆┆
│└┬ Document type X (e.g. Privacy Policy)
│ ├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-05-02T03-02-15Z.md)
┆ ┆
│ └ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-11-14T12-36-45Z.md)
┆
└┬ Service provider Y (e.g. Google)
├┬ Document type 1 (e.g. Developer Terms)
│├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2019-03-12T04-18-22Z.md)
┆┆
│└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-12-04T22-47-05Z.md)
└┬ Document type Z (e.g. Privacy Policy)
┆
├ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-05-02T03-02-15Z.md)
┆
└ YYYY-DD-MMTHH-MM-SSZ.md (e.g. 2021-11-14T12-36-45Z.md)
License
This dataset is made available under an Open Database (OdBL) License by Open Terms Archive Contributors.