-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implement ExtPos for fixed expressions; is CCONJ possible for "as opposed to", "rather than", "instead of"? #530
Comments
Can you give a bit more explanation on what ExpPos means in this case or how the external deprel will be represented? |
External POS: https://universaldependencies.org/en/feat/ExtPos.html For example, "instead" is individually an ADV, but where it attaches as |
(BTW this has also been implemented in GUM) |
BTW2: In v2.14, most of the treebanks that use For consistency, it would be nice to have the same policy in others such as English-GUM. |
For EWT I've just moved it to MISC following @dan-zeman's statement that FEATS should be reserved for properties of individual words, not larger units. |
Yes, it's in MISC in GUM for the same reason. |
New issue about standardizing ExtPos at the universal level: UniversalDependencies/docs#1037 |
Implemented in the above commit. I've made some small updates to the English One question:
|
I think "as opposed to" is like "rather than"—its coordination vs. prepositional function depends on context.
|
@amir-zeldes OK to add CCONJ as a possibility for "as opposed to"? |
I'm not totally sure, but why do we need it to be cc syntactically? I mean, how is it different from like/unlike:
But we don't think "like/unlike" is a cc, right? |
My intuition is that "as opposed to" readily connects a wider range of phrase types, and does not always require a prosodic break:
"Rather than" can be substituted. Having trouble coming up with "like"/"unlike" examples that are good parallels. |
Mm, maybe it's headed that way, but I think it's still not quite a CC yet, since it can be pre-poned. OntoNotes example:
|
I'm not arguing it's exclusively CCONJ. We concluded that "rather than" is sometimes CCONJ, sometimes ADP/SCONJ. |
Here's another piece of evidence—coordination of transitive VBGs in the progressive construction:
|
Right, I'm just thinking if we allow it to be CCONJ, then this case would probably be tagged that way:
But as the original example shows, this case is actually invertible, which suggests it isn't CCONJ. Because it's hard to tell if a case is invertible, and because I think underlyingly this is syntactically still case, I would rather play it safe and just tag it uniformly as case/mark, and not leave room for inter-annotator inconsistencies which would come in if we allow two analyses.
Yeah, I'm not saying unlike has the same distribution, just that I'm not clear on when "as opposed to" is frontable, which, at least when it is, suggests it's not CCONJ. In that respect it's similar to "unlike", which is also often frontable. |
How is it different from "rather than"? https://universaldependencies.org/en/dep/fixed.html#augmentativesubstitutiveconjunctive-expressions |
TBH probably only in that it was put on the list during SD already. If someone had asked me I would have probably raised the same objection about it ("rather than" is very often frontable):
That shouldn't be CCONJ either IMO. |
The rule we decided in #182 was to use case/mark only for the fronted ones, and cc for the others. |
Right - since it was already on the fixed list, I thought it was better to at least have the fronted cases as case/mark, and I still think that's better than some weird inverted conj or something. But in an ideal world, I think neither of these should be conj, so I think the fewer of these things we have the better. |
I just worry that saying it is never CCONJ will paint us into a corner and produce some strange structures. Here's one I found in COCA:
Hard for me to avoid reading that with "she rather than the father" as a constituent, and Another one:
"During" is a (transitive) preposition. Prepositions can't take PP modifiers right? Looks to me like the equivalent of [during and not [before or after]] practice. Also, in terms of paraphrasing, the "as opposed to Y" part can be moved to the end of the clause, but it relies on heavy inference that "during" is in focus:
Fronting just sounds like a bizarre misuse of ellipsis:
One more, with adjectives:
(UPDATE) OK I lied, one more with determiners/quantifying-adjectives:
|
I think these arguments apply to things like "instead of" too -
Would you want "instead of" to also be a CC? I think these are just nmod/obl + case as appropriate - the simpler, more consistent the analysis the better IMO. Just saying "instead of" ->case, "as opposed to" -> case makes it easier to teach and validate. As for fronting, I think the reason it sounds weird in "As opposed to prior to or post, self-talk was employed.." is that the elided repeated argument of prior does not occur on first mention. This would be much better as:
|
Yeah good point, "instead of" patterns similarly to "rather than" and "as opposed to" (there may be slight differences though, I'm not sure). You really think "during instead of before practice" would be nmod(during, before)? I don't know that we have precedent for an nmod between two prepositions. They are not nominals but rather markers-of-nominals. |
No, I'd say obl since it's not a nominal. You could also see it as a kind of promotion I guess... |
But we wouldn't use a promotion analysis for "during or before practice". Apart from coordination and a few other exceptions, the universal guidelines speak against function words having dependents. The "promotion by head elision" examples given there are pretty different because they do not involve a dependency between two function words. Anyway for now I'll stick with the existing trees when assigning ExtPos, but I do think the |
To resurrect this whole thing - I hadn't finished implementing it in PUD. Going back to look at that, I found this example for "such as":
Most of the |
Looking at the "up to" in EWT, there seem to be some which are not labeled: Some labeled examples:
Here are some not labeled examples which are kind of similar to others above: this looks very similar to "up to $1,000"
not labeled because it was viewed as "heat up" as opposed to "up to", fixed? but above "kept up to" was treated as "kept - up to" instead of "kept up"
these aren't labeled fixed - lots of other "up to" which were not labeled but clearly shouldn't be, here i wonder if these are actually fixed
|
|
The Core Group decided it would be a good idea for treebanks to specify how each
fixed
expression functions externally viaExtPos
in the MISC column.This is already implemented for a few expressions in EWT. We might as well expand to all of them. If the external deprel is correct, it can be used to infer the ExtPos (which is one of ADV, ADP, CCONJ SCONJ).
The text was updated successfully, but these errors were encountered: