-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
converging UD_Russian and UD_Russian-SynTagRus annotation #10
Comments
|
ru-syntagrus: 1 Более более ADV _ Degree=Cmp 4 nsubj 4:nsubj _ 21 больше много NUM _ _ 23 nummod:gov 23:nummod:gov _ |
более/больше/менее/меньше should be linked to the numeral head, cf. террористов там было не более двух. |
|
I think that the ordinals are compounds: compound(второго, сорок) |
Related to numerals is UniversalDependencies/docs#455. |
nmod (dep?) depending on ADJ or ADV --> obl |
If the ADJ or ADV is a head of copula construction then you are right: such ADJ|ADV should not have BTW: This is exactly the case when the |
acl with participles (single participles vs. prtcp group), advcl vs. acl. Need attention. |
discourse/parataxis is tagged differently in two treebanks |
vocative: check parataxis & NOUN & Animacy=Anim in ru-SynTagRus |
Cases like "сорок пять" should be annotated as сорок >flat пять according to http://universaldependencies.org/u/dep/flat.html.
In UD2.0 files: ru: сорок >compound пять, сорок <nummod пять
ru-syntagrus: сорок <nummod:gov пять
"Universal" approach is somewhat problematic since in двадцать один, двадцать два, двадцать три, двадцать четыре the last numeral predicts the case of the noun (cf. nummod:gov), so we will have different tags on the first numeral word depending what its dependent is.
::::: 1--4: the rules seem to be all right, but some overgeneralization happens
The text was updated successfully, but these errors were encountered: