-
Notifications
You must be signed in to change notification settings - Fork 5
/
Copy patheval.log
46 lines (46 loc) · 2.45 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_English-GUM:
commit 4b44789f75afda58870631ce8a5c6ec607227e10
Merge: 2f28a01 34d01cb
Author: Dan Zeman <[email protected]>
Size: counted 211920 of 211920 words (nodes).
Size: min(0, log((N/1000)**2)) = 10.7124176899276.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 1.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 143519 out of 211920 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 36 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 2489
Udapi: found 2489 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 211920 words.
Genres: found 11 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-GUM/en_gum-ud-dev.conllu
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-GUM/en_gum-ud-test.conllu
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-GUM/en_gum-ud-train.conllu
[Line 1170 Sent GUM_academic_census-6 Node 19]: [L3 Warning fixed-gap] Gaps in fixed expression [19, 23] 'due * * * to'
Warnings: 1
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.647058823529412) = 0.0497737556561086
(weight=0.0769230769230769) * (score{lemmas}=1) = 0.0769230769230769
(weight=0.256410256410256) * (score{size}=0.775390648429725) = 0.198818114981981
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.8) = 0.0615384615384615
(weight=0.307692307692308) * (score{udapi}=0.882550018875047) = 0.271553851961553
(weight=0.0769230769230769) * (score{udeprels}=0.972972972972973) = 0.0748440748440748
(TOTAL score=0.846271848725768) * (availability=1) * (validity=1) = 0.846271848725768
STARS = 4
UD_English-GUM 0.846271848725768 4