| ▲ | whycombinetor 2 days ago | |
Third party benchmarks like terminalbench exist. W.r.t code changes especially small ones (say 50 lines spread across 5 files), if you can't get an agent to make nearly exactly the code changes you want, just faster than you, that's a you problem at this point. If it maybe would take you 15 minutes, grok-code-fast-1 can do it in 2. | ||