From 391504d89780949faf2e1493b51bb023cf0a9ba8 Mon Sep 17 00:00:00 2001 From: Anne van Kesteren Date: Fri, 20 Sep 2024 10:35:11 +0200 Subject: [PATCH] Further update the index gb18030 ranges explanation to account for GB18030-2000 --- encoding.bs | 17 +++++++++-------- 1 file changed, 9 insertions(+), 8 deletions(-) diff --git a/encoding.bs b/encoding.bs index d55dd3e..fe4e1a9 100644 --- a/encoding.bs +++ b/encoding.bs @@ -845,12 +845,13 @@ specification, excluding index single-byte, which have their own table: index-gb18030-ranges.txt This index works different from all others. Listing all code points would result in over a million items whereas they can be represented neatly in 207 ranges combined with trivial - limit checks. It therefore only superficially matches the GB18030-2005 standard for code points - encoded as four bytes. It does not match the GB18030-2022 standard as that would increase the - number of byte sequences mapping to Private Use code points. And the relevant Private Use code - points are mapped in the gb18030 encoder directly through a side table to maximize - compatibility with how they were mapped in GB18030-2005. See also - index gb18030 ranges code point and index gb18030 ranges pointer below. + limit checks. It therefore only superficially matches the GB18030-2000 standard for code points + encoded as four bytes. The change for the GB18030-2005 revision is handled inline by the + index gb18030 ranges code point and index gb18030 ranges pointer algorithms below + that accompany this index. And the changes for the GB18030-2022 revision are handled differently + again to not further increase the number of byte sequences mapping to Private Use code points. The + relevant Private Use code points are mapped in the gb18030 encoder directly through a side + table to preserve compatibility with how they were mapped before. index jis0208 index-jis0208.txt @@ -2501,8 +2502,8 @@ consumers of content generated with GBK's encoder. 0xFE 0xA0 -

This asymmetric encoder table is introduced to maximize compatibility with - GB18030-2005. See also the explanation at index gb18030 ranges. +

This asymmetric encoder table preserves compatibility with the GB18030-2005 + standard. See also the explanation at index gb18030 ranges.

  • Let pointer be the index pointer for code point in index gb18030.