| ▲ | cornholio 5 hours ago | |||||||
If I understood correctly, the model will get it right because it knows when it isn't right. | ||||||||
| ▲ | zambelli 5 hours ago | parent | next [-] | |||||||
Essentially, yes that's right! There's some subtlety in how to let it know it was wrong (returning things as tool errors because it trained on that), but that's the gist of it - sort of a self-correcting architecture. | ||||||||
| ▲ | tomjakubowski 2 hours ago | parent | prev [-] | |||||||
| ||||||||