Selected Letters of Mondriaan in Text-Fabric.
In this repository we prepare a "Proeftuin" of 14 letters of Mondriaan for the application of data science.
We make a test bed for the data transformations from the original TEI through Text-Fabric to Web Annotations, the format that will drive the resulting website.
In this repository you can see and follow and reproduce the whole chain.
The results in this repo can be used to visualise the effects of data design on the interface, so that we can get feedback on them, in order to arrive at a display that meets the demands of the Mondriaan project.
-
if you do not have Python installed, install it.
-
if you do not have Text-Fabric installed, install it by opening a terminal/command line and saying:
pip install 'text-fabric[all]'
or, if you have it already, check whether an upgrade is available:
pip install --upgrade 'text-fabric[all]'
-
Start the Text-Fabric browser, from the command line:
text-fabric annotation/mondriaan
This will fetch the corpus and open a browser window where you can leaf through the texts and make queries. Corpus information and Help are provided in the left side bar.
-
Alternatively, you can work in a Jupyter notebook:
pip install jupyterlab
jupyter lab
and inside the notebook, in a code cell, run
from tf.app import use A = use('annotation/mondriaan')
which will also download the corpus.
In both cases, the corpus ends up in your home directory,
under text-fabric-data
.
We have published several tutorial notebooks that show how the Text-Fabric version of the data can be used.
Use transcription as a reference to the ins and outs of the TF encoding of the data. This file is one click away when you are using Text-Fabric.
Several memory institutions and their associated developers are working to make rich datasets available to the public and researchers.
The Mondriaan Letters collection is such a dataset for which active development is taking place as part of the Mondrian edit project.
Participating institutes are:
See about for the provenance of the data and a more detailed list of all people involved.
As the badge above indicates: this is work in progress. Although we use this repository to develop an official website to publish the letters by Mondriaan, nothing that you see here is a promise of what that website will look like.
-
2023-04-21 Source data, Text-Fabric data and Web annotation data all present, the pipeline works. The conversion aims at conserving all information that is present in the source throughout all stages in the pipeline. Later we should discuss which elements should be rendered and in what ways, and with what controls on the interface.
-
2023-04-12 Initial content, the source data is not yet present, hence the programs do not yet work in this context. They do work in another repo, but that repo is not accessible.
See some other TF datasets on GitHub:
-
Herman Melville, Moby Dick
-
Descartes, Letters
-
or one of these corpora.
For those who have worked on the source material, and the ones that work on the Mondriaan project, see about.
Dirk Roorda has set up and organized this repo, and designed the data conversions. He has also written Text-Fabric.