This bundle contains 99 sentences annotated according to the foundational layer of UCCA. The total number of tokens in this corpus is 1312.
The English corpus used here is the book "The Little Prince" (Le Petit Prince), a classic novel written in French by Antoine de Saint-Exupéry, and first published in 1943. This is the same text as was used in the AMR Little Prince corpus in English (https://amr.isi.edu/download.html).
Information about the format of the xml files and source code for reading and manipulating them are available at https://universalconceptualcognitiveannotation.github.io/.
The annotation was conducted at the Hebrew University of Jerusalem. If you use this corpus, please cite:
@inproceedings{Oep:Abe:Abz:20,
author = {Oepen, Stephan and Abend, Omri and Abzianidze, Lasha and
Bos, Johan and Haji\v{c}, Jan and Hershcovich, Daniel and
Li, Bin and O'Gorman, Tim and Xue, Nianwen and Zeman, Daniel},
title = {{MRP}~2020: {T}he {S}econd {S}hared {T}ask on
{C}ross-Framework and {C}ross-{L}inguistic
{M}eaning {R}epresentation {P}arsing},
booktitle = {Proc. of CoNLL Shared Task},
year = 2020
}
The UCCA annotation is distributed under the "Attribution-ShareAlike 3.0 Unported" license (http://creativecommons.org/licenses/by-sa/3.0/). Please follow the link for exact details.