Remix.run Logo
throw0101d 3 days ago

For Unicode 17 more generally:

* https://www.unicode.org/versions/Unicode17.0.0/

* https://news.ycombinator.com/item?id=45187274

There are some charts with the new characters available at:

* https://www.unicode.org/charts/PDF/Unicode-17.0/

"CJK Unified Ideographs Extension J" has 4298 entries.

lifthrasiir 3 days ago | parent [-]

It is so amazing that the CJK Unified Ideographs block is still being extended to this day, even though I do know many intricacies of encoding those characters, like Z-variants and normalization rules and such. How many of these characters are left for encoding? I genuinely have no idea!

Freak_NL 3 days ago | parent [-]

It's probably academia catching up with historical documents digitised to Unicode. For CJKV any character can get added if it is found on an old scroll or something.

lifthrasiir 3 days ago | parent [-]

Of course, but I mean that there are only so many such historical documents in the world. So there is a limit that the CJK Unified Ideograph block can be extended. I'm surprised that the limit seems to be way higher than I initially thought.