You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently some words will be tagged with a POS based only part of the word. e.g.
~/e/i/stiff ❯❯❯ ftb-label ⏎ master ⬆ ✱ ◼
FinnTreeBank tagger (v0.1-alpha-150-g82bce74) using OMorFi and FinnPos
/usr/local/bin/hfst-optimized-lookup: Reading from STDIN. Writing to STDOUT.
/usr/local/bin/finnpos-label: Loading tagger.
/usr/local/bin/finnpos-ratna-feats.py: Reading from STDIN. Writing to STDOUT.
/usr/local/bin/omorfi2finnpos.py: Reading from STDIN. Writing to STDOUT
1
1
/usr/local/bin/finnpos-label: Reading from STDIN. Writing to STDOUT.
Minä
olin
humalassapa
.
Minä _ minä [POS=PRONOUN]|[SUBCAT=PERSONAL]|[PERS=SG1]|[NUM=SG]|[CASE=NOM] _
olin _ olla [POS=VERB]|[VOICE=ACT]|[MOOD=INDV]|[TENSE=PAST]|[PERS=SG1] _
humalassapa _ humalassa [POS=PARTICLE]|[CLIT=PA] _
. _ . [POS=PUNCTUATION] _
Noting that humalassapa has a particle is all very well, but what we actually want to know is that humalassa is adverb-like and humala is a noun since these are the headwords which exist in Wiktionary.
The text was updated successfully, but these errors were encountered:
For particle in particular, perhaps this is quite easy:
Minä _ minä [POS=PRONOUN]|[SUBCAT=PERSONAL]|[PERS=SG1]|[NUM=SG]|[CASE=NOM] _
pidän _ pitää [POS=VERB]|[VOICE=ACT]|[MOOD=INDV]|[TENSE=PRESENT]|[PERS=SG1] _
oluesta _ olut [POS=NOUN]|[NUM=SG]|[CASE=ELA] _
niin _ niin [POS=PARTICLE] _
paljon _ paljon [POS=ADVERB] _
. _ . [POS=PUNCTUATION] _
We know we have a true particle when lemma == surface -- if the lemma given by FinnPOS is something which has another analysis according to OMorFi then perhaps something more complex is going on.
Currently some words will be tagged with a POS based only part of the word. e.g.
Noting that humalassapa has a particle is all very well, but what we actually want to know is that humalassa is adverb-like and humala is a noun since these are the headwords which exist in Wiktionary.
The text was updated successfully, but these errors were encountered: