▲ | jibal 7 days ago | ||||||||||||||||
"Unicode, being a byte code format" UTF-8 is a byte code format; Unicode is not. In Python, where all strings are arrays of Unicode code points, substrings are likewise arrays of Unicode code points. | |||||||||||||||||
▲ | zahlman 6 days ago | parent [-] | ||||||||||||||||
The point is that not all sequences of characters ("code point" means the integer value, whereas "character" means the thing that number represents) are valid. | |||||||||||||||||
|