| ▲ | jrahmy 11 days ago | ||||||||||||||||
A tokenizer is a deterministic string-matching program, it's not made out of weights in the same sense as a neural network itself. | |||||||||||||||||
| ▲ | bfung 11 days ago | parent | next [-] | ||||||||||||||||
How does one choose what sequence of bytes constitutes a token? | |||||||||||||||||
| ▲ | davrosthedalek 11 days ago | parent | prev [-] | ||||||||||||||||
But it could be. It's just less efficient. | |||||||||||||||||
| |||||||||||||||||