Remix.run Logo
cyanydeez a day ago

no, you're misinterpreting what its doing. It's not...

Yes, what it's doing is using continuation phrases. It wants to continue, and these token-combos, like Tekken 4, let it move from one gradient descent to the next, and like Tony Hawk, perform combo after combo, so it can just keep producing tokens.

Because thats how they're trained. In another thread, someone wished they'd taught the models "I dont know" and were extremely convinced that some how you could train a model to stop producing tokens, or whatever. You can't both train the model to generate output and also teach it not to. That's their whole bag of tricks.

But it's not trying to be clever, it's trying to keep generating.

cindyllm 13 hours ago | parent [-]

[dead]