Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode collation #10

Open
prataprc opened this issue Jan 15, 2019 · 0 comments
Open

Unicode collation #10

prataprc opened this issue Jan 15, 2019 · 0 comments

Comments

@prataprc
Copy link
Owner

prataprc commented Jan 15, 2019

Unicode specification for sorting string has this to say:

Because collation varies by language and not just by script, it is not possible to arrange the encoding for characters so that simple binary string comparison produces the desired collation order for all languages. Because multi-level sorting is a requirement, it is not even possible to arrange the encoding for characters so that simple binary string comparison produces the desired collation order for any particular language.

Note that the current implementation of Ord on JSON string simply does a byte-by-byte comparison.

@prataprc prataprc added the help wanted Extra attention is needed label Jan 15, 2019
@prataprc prataprc removed help wanted Extra attention is needed labels Feb 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant