▲ | hedora 7 months ago | |||||||
Can you still assume the bytes 0x00 and 0xFF are not present in the string (like in UTF-8?) | ||||||||
▲ | int_19h 7 months ago | parent [-] | |||||||
Yes. The only difference between UTF-8 and WTF-8 is that the latter does not reject otherwise valid UTF-8 byte sequences that correspond to codepoints in range U+D800 to U+DFFF (which means that, in practice, a lot of things that say they are UTF-8 are actually WTF-8). | ||||||||
|