| ▲ | JSR_FDED 2 hours ago | |
Why would you deprive the LLM of a signal that indicates how badly it screwed up? | ||
| ▲ | carsareok 44 minutes ago | parent [-] | |
Because it's a completion engine and has no notion of "signals". Swearing was in the texts they were trained on to complete token by token. I suspect it weren't texts with a lot of high-quality reasoning. | ||