| ▲ | edude03 7 hours ago | |||||||||||||
Essentially the more turns you have the more the agent is likely to fail since the error compounds per turn. Agentic model are tuned for “long horizon tasks” ie being able to go many many turns on the same problem without failing. | ||||||||||||||
| ▲ | zamadatix 7 hours ago | parent [-] | |||||||||||||
Much appreciated, but I mean more around "what do the error bars in the figure represent" than what the turn scaling itself is. | ||||||||||||||
| ||||||||||||||