-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UPOS inconsistency: indefinite pronouns/pro-adverbs #132
Comments
Looks like the latter comes from xpos=NN, which seems to happen sporadically when it's an object ("I want somewhere...") or a PP ("from somewhere"). In PTB these are consistently RB, incl. "out of nowhere" etc., so I think morphosyntactically they should be RB+ADV here as well, even though they are 'wrapped' in an NP. |
In another corpus I encountered "nowhere but X", where "but X" should be a PP modifying "nowhere". But this is awkward if "nowhere" is an ADV attaching as advmod. "Nowhere" also licenses "else", like the other indefinite pronouns. So I'm not sure I see the motivation to call "nowhere" ADV if "nobody" is PRON. Is "nowhere" just a pronoun that often heads an adverbial NP? |
No, I think it's still an adverb, it's not so unusual for an adverb to take modifiers IMO. I mean, it's interchangeable with "here" and "there", so it seems pretty canonical to consider it an adverb. |
And its distribution is even more like "where", which also licenses "else". I suppose it would be surprising to call "where" a pronoun, so ADV with Here are some borderline nominal cases:
I think you're arguing these should all be |
Basically I think this is another flavor of the same issue we talked about today RE: nsubj: it's like a unary derivation where you wrap an ADV in something to allow it to work as an argument etc.. But the underlying POS category is still ADV, otherwise you'd get the same issue with core adverbs like "here" and "there" ("in there", "the good old here and now") - in context these things can get case, amod, and other functions because they function like NPs, but morphologically I'd say they are still adverbs, and I expect PTB tagging follows the same behavior for xpos. |
PPs yes, but there are other PPs with non-nominal complements (e.g. "for free"). "The here and now" is an idiom (*the now and here). "Here" and "there" don't really lend themselves to adjective dependents in the way that "somewhere" does. |
While implementing UniversalDependencies/docs#517, rediscovering this issue of indefinite pro-forms occurring sometimes in noun-like contexts, for which EWT is not consistent about tagging:
It seems that PTB (and GUM) policy is to tag these as adverbs. My question is then what the external deprel should be—if there is a free relative, does that make it more nominal-like, triggering
I suppose defaulting to |
…d always be adverbs (#132); also apply PronType=Ind to the retagged ones (UniversalDependencies/docs#517)
Also there's a GUM token that is NOUN https://universal.grew.fr/?custom=6688bce47f934 |
Fixed GUM token. Generally I think distinguishing "any place" from "anywhere" on the POS level is fine, since the fused orthography corresponds to the lexicalization which justifies the re-analyzed/non-compositional morphosyntactic analysis. |
PRON
: http://match.grew.fr/[email protected]&custom=603ad285dd480&eud=yesADV
, sometimesNOUN
. What is the criterion? http://match.grew.fr/[email protected]&custom=603ad208684df&clustering=N1.uposThe text was updated successfully, but these errors were encountered: