Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Turkish proper noun suffixes #188

Closed
ojwb opened this issue Dec 8, 2023 · 0 comments
Closed

Turkish proper noun suffixes #188

ojwb opened this issue Dec 8, 2023 · 0 comments

Comments

@ojwb
Copy link
Member

ojwb commented Dec 8, 2023

(Related to #187)

https://en.wikipedia.org/wiki/Turkish_language says "In modern Turkish orthography, an apostrophe is used to separate proper names from any suffixes" with the example "Türkiye'dir ("it is Turkey")". Currently we stem "türkiye'dir" to "türkiye'" but "türkiye" to "türki".

I think after removing a suffix we should also remove an apostrophe if one immediately precedes the suffix. A quick test shows we would then stem "türkiye'dir" to "türki".

Looking at turkish/voc.txt 9280 of 96325 entries contain an apostrophe.

@ojwb ojwb changed the title Turkish proper nouns suffixes Turkish proper noun suffixes Dec 8, 2023
ojwb added a commit to snowballstem/snowball-data that referenced this issue Oct 12, 2024
@ojwb ojwb closed this as completed in 94880d9 Oct 12, 2024
ojwb added a commit to snowballstem/snowball-website that referenced this issue Oct 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant