| ▲ | lbrito 4 hours ago | ||||||||||||||||
>undeniable, massive productivity gains. How can something so undeniable have zero scientific evidence? Are there any large peer reviewed or meta studies confirming your claim? | |||||||||||||||||
| ▲ | aspenmartin 4 hours ago | parent | next [-] | ||||||||||||||||
It’s a very hard experiment to run. You have a population that’s already “treated”. You can’t blind them to the fact that they’re using AI tools. It’s hard to imagine a study that wouldn’t have serious flaws that people would then use to dismiss and form their own conclusions. Sure you have METR but that was very low n with a very old model. I think the surest sign of productivity gains is the sheer volume of adoption. If you look beyond headlines, adoption is just incredible. Of course adoption does not necessarily point to productivity gains, but if this was some sort of FOMO or smoke and mirrors you would not see this much retention and this feverish a pace of adoption. You would not see a large segment of the profession using coding agents exclusively. All of these companies track productivity, again with imperfect proxies, yet everything points to a pretty consistent picture. Same with benchmarks, again a lot of crappy benchmarks but a lot of high quality ones too and a very diverse collection of tasks and capabilities they probe. | |||||||||||||||||
| |||||||||||||||||
| ▲ | _aavaa_ 4 hours ago | parent | prev [-] | ||||||||||||||||
Because even in a field like software engineering where the output of our work is save in version control, measuring baseline productivity is hard. LoC: people argue it’s not what’s important PRs/day: same as LoC Getting projects done faster: oh but what about the quality. Solve the technical problems and actually be more productive, the social systems build around the old way of doing things will hole you back. Finish a PR in 10 minutes doesn’t matter if you’re waiting days for a human review. | |||||||||||||||||