This is an algorithmic toolkit for R, designed for transparent quantitative analysis of the Finnish national bibliography Fennica metadata collection.
The data is summarized in the following automatically generated files:
- Overall summary of the Fennica metadata
- Analyses on specific publication places and other topics (see the .md files)
- Digital History in Finland seminar (University of Helsinki, Dec 9, 2015)
The algorithmic details to reproduce these summaries from the raw data are fully documented with the source code. This includes several steps from raw data extraction to harmonizing the textual annotation fields, preprocessing the information, and carrying out statistical analysis and visualization. This packages utilizes additional tools from the more generic bibliographica and many other R packages. The raw data is confidential and available only on a separate agreement, so we can only publish statistical summaries and the our own analysis source code here.
Authors: Leo Lahti, Niko Ilomaki, Hege Roivainen, Mikko Tolonen. Part of rOpenGov.
The tools are under active, open development; the tools, analysis, and documentation are being constantly updated. You are welcome to:
- submit suggestions and bug reports
- send a pull request (we will acknowledge contributions)
- join IRC at !ropengov@Freenode
- contact or follow us