▲ | advisedwang 5 days ago | ||||||||||||||||
UTF-8 the first byte isn't just 1xxxxxxx for continuation, it's either 110xxxxx, 1110xxxx, or 11110xxx depending on how many bytes that character will take up. | |||||||||||||||||
▲ | necovek 4 days ago | parent [-] | ||||||||||||||||
Good point, I didn't pay close attention. In a sense, a shame this encoding wasn't structured like UTF-8, or even the other way around, a shame UTF-8 wasn't structured in this, more generic way. | |||||||||||||||||
|