Remix.run Logo
JSR_FDED 2 hours ago

Why would you deprive the LLM of a signal that indicates how badly it screwed up?

carsareok 44 minutes ago | parent [-]

Because it's a completion engine and has no notion of "signals".

Swearing was in the texts they were trained on to complete token by token. I suspect it weren't texts with a lot of high-quality reasoning.