Surrogate pair encoding char-codes #x10000-#x1FFFF #6

rpgoldman · 2021-02-22T02:45:56Z

Original description:

According to the documentation found in the net [1], characters outside basic multilingual plane should be encoded using surrogate pairs.

This is minimally tested, but works on my use case where I need to feed EMOJI to a mobile client and my users were getting garbage displayed to them.

Optimally, the decoding side should probably be hacked to be able to do the reverse as well. I don't have need for this at the moment so haven't looked into it yet.

[1] http://www.ietf.org/rfc/rfc4627.txt "2.5 Strings"

ghard added 2 commits January 28, 2016 17:23

Surrogate pair encoding char-codes #x10000-#x1FFFF

9e56d86

Fixed the fix. Now users escapes.

e7726ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Surrogate pair encoding char-codes #x10000-#x1FFFF #6

Surrogate pair encoding char-codes #x10000-#x1FFFF #6

rpgoldman commented Feb 22, 2021 •

edited

Loading

Surrogate pair encoding char-codes #x10000-#x1FFFF #6

Are you sure you want to change the base?

Surrogate pair encoding char-codes #x10000-#x1FFFF #6

Conversation

rpgoldman commented Feb 22, 2021 • edited Loading

rpgoldman commented Feb 22, 2021 •

edited

Loading