▲ | flohofwoe 7 days ago | |||||||||||||||||||||||||||||||
> Please stop pushing this UTF-8 everywhere nonsense. I was with you until this sentence. UTF-8 everywhere is great exactly because it is ASCII-compatible (e.g. all ASCII strings are automatically also valid UTF-8 strings, so UTF-8 is a natural upgrade path from ASCII) - both are just encodings for the same UNICODE codepoints, ASCII just cannot go beyond the first 127 codepoints, but that's where UTF-8 comes in and in a way that's backward compatible with ASCII - which is the one ingenious feature of the UTF-8 encoding. | ||||||||||||||||||||||||||||||||
▲ | 0x000xca0xfe 7 days ago | parent | next [-] | |||||||||||||||||||||||||||||||
I'm not advocating for ASCII-everywhere, I'm for bytes-everywhere. And bytes can conveniently fit both ASCII and UTF-8. If you want to restrict your programming language to ASCII for whatever reason, fine by me. I don't need "let wohnt_bei_Böckler_STRAẞE = ..." that much. But if you allow full 8-bit bytes, please don't restrict them to UTF-8. If you need to gracefully handle non-UTF-8 sequences graphically show the appropriate character "�", otherwise let it pass through unmodified. Just don't crash, show useless error messages or in the worst case try to "fix" it by mangling the data even more. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
▲ | numpad0 7 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||
UTF-8 everywhere is not great and UTF-8 in practice is hardly ASCII-compatible. UTF-8 in source codes and file paths outside pure ASCII range breaks a lot of things especially on non-English systems due to legacy dependencies, ironically. Sure, it's backward compatible, as in ASCII handling codes work on systems with UTF-8 locales, but how important is that? | ||||||||||||||||||||||||||||||||
|