▲ | astrange 2 days ago | |
The text returned by the tool itself makes it not "next token prediction". Aside from having side effects, the reason it's helpful is that it's out of distribution for the model. So it changes the properties of the system. | ||
▲ | SEGyges 12 hours ago | parent | next [-] | |
This is true of the system as a whole, but the core neural network is still a next-token predictor. | ||
▲ | porridgeraisin 2 days ago | parent | prev [-] | |
Ah ok, understood what you meant. |