▲ | kmeisthax 16 hours ago | |
This doesn't self-synchronize. Removing an arbitrary byte from the text stream (e.g. SOL / EOL) will change the meaning of codepoints far away from the site of the corruption. What it sounds like you want is an easy way for English-language programmers to skip or strip non-ASCII text without having to reference any actual Unicode documentation. Which is a Unicode non-goal, obviously. And also very bad software engineering practice. I'm also not sure what you're getting at with national borders and language authorities, but both of those were absolutely involved with Unicode and still are. |