| ▲ | irthomasthomas 2 hours ago | |
Why frame it as rigging? I assume they would teach the models to improve on tasks the public find interesting. Then we just have to come up with more challenges for it. | ||
| ▲ | krackers an hour ago | parent [-] | |
It's not rigging—it's just RL. | ||