| ▲ | root_axis 3 hours ago | |
The tests are absolutely essential, otherwise there's no signal to guide the LLM towards correct behavior and hallucinations accumulate until any hope of forward progress collapses. | ||
| ▲ | falcor84 2 hours ago | parent [-] | |
Obviously the signal is comparison against the behavior of the original. | ||