|
| ▲ | bfung 11 days ago | parent | next [-] |
| How does one choose what sequence of bytes constitutes a token? |
|
| ▲ | davrosthedalek 11 days ago | parent | prev [-] |
| But it could be. It's just less efficient. |
| |
| ▲ | jrahmy 11 days ago | parent [-] | | I don't see how. You could ask a neural network to do the tokenization I suppose, but in doing so you'd have to convert the prompt into tokens via the same deterministic process the network was trained on, essentially just moving the exact same process up one layer. | | |
|