You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
https://en.wikipedia.org/wiki/Turkish_language says "In modern Turkish orthography, an apostrophe is used to separate proper names from any suffixes" with the example "Türkiye'dir ("it is Turkey")". Currently we stem "türkiye'dir" to "türkiye'" but "türkiye" to "türki".
I think after removing a suffix we should also remove an apostrophe if one immediately precedes the suffix. A quick test shows we would then stem "türkiye'dir" to "türki".
Looking at turkish/voc.txt 9280 of 96325 entries contain an apostrophe.
The text was updated successfully, but these errors were encountered:
ojwb
changed the title
Turkish proper nouns suffixes
Turkish proper noun suffixes
Dec 8, 2023
ojwb
added a commit
to snowballstem/snowball-data
that referenced
this issue
Oct 12, 2024
(Related to #187)
https://en.wikipedia.org/wiki/Turkish_language says "In modern Turkish orthography, an apostrophe is used to separate proper names from any suffixes" with the example "Türkiye'dir ("it is Turkey")". Currently we stem "türkiye'dir" to "türkiye'" but "türkiye" to "türki".
I think after removing a suffix we should also remove an apostrophe if one immediately precedes the suffix. A quick test shows we would then stem "türkiye'dir" to "türki".
Looking at turkish/voc.txt 9280 of 96325 entries contain an apostrophe.
The text was updated successfully, but these errors were encountered: