Remix.run Logo
lifthrasiir 3 days ago

It is so amazing that the CJK Unified Ideographs block is still being extended to this day, even though I do know many intricacies of encoding those characters, like Z-variants and normalization rules and such. How many of these characters are left for encoding? I genuinely have no idea!

Freak_NL 3 days ago | parent [-]

It's probably academia catching up with historical documents digitised to Unicode. For CJKV any character can get added if it is found on an old scroll or something.

lifthrasiir 3 days ago | parent [-]

Of course, but I mean that there are only so many such historical documents in the world. So there is a limit that the CJK Unified Ideograph block can be extended. I'm surprised that the limit seems to be way higher than I initially thought.