-
Notifications
You must be signed in to change notification settings - Fork 6
/
eval.log
52 lines (52 loc) · 3.25 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_Romanian-RRT:
commit c5fdab5b6814409f9ae322d8ab19a4154e199fed
Author: Dan Zeman <[email protected]>
Date: Sun May 5 13:20:55 2024 +0200
Size: counted 218522 of 218522 words (nodes).
Size: min(0, log((N/1000)**2)) = 10.7737733919319.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.4.
Universal POS tags: 16 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.9.
Features: 183329 out of 218522 total words have one or more features.
Features: source of annotation (from README) factor is 0.9.
Universal relations: 35 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 90
Udapi: found 90 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 218522 words.
Genres: found 7 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang ro --max-err=10 UD_Romanian-RRT/ro_rrt-ud-dev.conllu
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang ro --max-err=10 UD_Romanian-RRT/ro_rrt-ud-test.conllu
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang ro --max-err=10 UD_Romanian-RRT/ro_rrt-ud-train.conllu
[Line 2690 Sent train-105 Node 11]: [L3 Warning leaf-det] 'det' not expected to have children (11:câteva:det --> 10:vreo:amod)
[Line 65178 Sent train-2315 Node 8]: [L3 Warning leaf-det] 'det' not expected to have children (8:celor:det --> 11:editat:acl)
[Line 101168 Sent train-3943 Node 13]: [L3 Warning leaf-det] 'det' not expected to have children (13:mult:det --> 16:trebuie:advcl)
[Line 115974 Sent train-4508 Node 1]: [L3 Warning leaf-det] 'det' not expected to have children (1:Orice:det --> 9:au:aux)
[Line 130589 Sent train-5101 Node 1]: [L3 Warning leaf-det] 'det' not expected to have children (1:Multă:det --> 3:mâncarea:nmod)
[Line 158791 Sent train-6097 Node 26]: [L3 Warning leaf-det] 'det' not expected to have children (26:același:det --> 32:statele:nmod)
[Line 164700 Sent train-6323 Node 4]: [L3 Warning leaf-det] 'det' not expected to have children (4:Toate:det --> 20:vide:appos)
Warnings: 7
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.9) = 0.0692307692307692
(weight=0.0769230769230769) * (score{genres}=0.411764705882353) = 0.0316742081447964
(weight=0.0769230769230769) * (score{lemmas}=0.4) = 0.0307692307692308
(weight=0.256410256410256) * (score{size}=0.779831722232018) = 0.199956851854364
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.847058823529412) = 0.065158371040724
(weight=0.307692307692308) * (score{udapi}=0.995881421550233) = 0.306425052784687
(weight=0.0769230769230769) * (score{udeprels}=0.945945945945946) = 0.0727650727650728
(TOTAL score=0.827261607871695) * (availability=1) * (validity=1) = 0.827261607871695
STARS = 4
UD_Romanian-RRT 0.827261607871695 4