| ▲ | LUmBULtERA 9 hours ago | |
That's yet to be determined. I think a lot of open-weight models are benchmaxxed and their usefulness for many tasks are not represented by those. | ||
| ▲ | enraged_camel 8 hours ago | parent [-] | |
Yes, this has been my experience. They all struggle with long-horizon tasks and eventually start going in circles. | ||