| ▲ | NitpickLawyer 6 hours ago | |
It would be trivial to detect such gaming, tho. That's the beauty of the test, and that's why they're probably not doing it. If a model draws "perfect" (whatever that means) pelicans on a bike, you start testing for owls riding a lawnmower, or crows riding a unicycle, or x _verb_ on y ... | ||
| ▲ | Sharlin 6 hours ago | parent [-] | |
It could still be special-case RLHF trained, just not up to perfection. | ||