You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
According to the word segmentation rules, this should be treated as a single word (see detailed discussion: unicode-rs/unicode-segmentation#90, rust-lang/regex#743). However, the current demo site splits it into two words. While splitting here is permitted per the Notes section on splitting between different character sets, the formal rules are that there should be no split here, which is what the demo site should reflect.
The text was updated successfully, but these errors were encountered:
markusicu
added
the
from public
Feedback/bug report from the public, that is, not from a Unicode Tools/UCD contributor/maintainer.
label
Sep 24, 2021
Consider the string
"abc를"
.According to the word segmentation rules, this should be treated as a single word (see detailed discussion: unicode-rs/unicode-segmentation#90, rust-lang/regex#743). However, the current demo site splits it into two words. While splitting here is permitted per the Notes section on splitting between different character sets, the formal rules are that there should be no split here, which is what the demo site should reflect.
The text was updated successfully, but these errors were encountered: