-
Notifications
You must be signed in to change notification settings - Fork 42
-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unify lemmas of pants, glasses, jeans #476
Comments
I suppose a similar question would apply to |
(or also |
it should be noted that this was quite similar to |
Hm, not sure - I mean, pants do have two separate pants... It's not usually used that way, but you can find examples like:
etc. So it's not exactly super weird, at least I think there is no way that "clothes" can be broken down into several pieces of "cloth" anymore. But I don't feel very strongle about it, I'm happy to go with the lemmas "clothes, pants, glasses, jeans" if there is a consensus on that. |
Changed "sunglasses" in EWT. I take it the rest of the issue overlaps with UniversalDependencies/docs#999. |
What do we do with single items with plural word forms such as
pants
,glasses
,sunglasses
,jeans
, etc?In general,
jeans
orpants
get lemmatized to the plural lemma in EWT but then to the singular in GUM:EWT:
but in GUM:
then both treebanks agree that one pair of glasses is lemmatized as
glass
, but that's inconsistent withpants
orjeans
and is kind of unsatisfying IMOEWT
@nschneid @amir-zeldes
The text was updated successfully, but these errors were encountered: