▲ | candiddevmike a day ago | |||||||
People's interpretation of benchmarks will largely depend on whether they believe they will be better or worse off by GenAI taking over SWE jobs. Think you'd need someone outside the industry to weigh in to have a real, unbiased view. | ||||||||
▲ | douglasisshiny a day ago | parent [-] | |||||||
Or someone who has been a developer for a decade plus trying to use these models on actual existing code bases, solving specific problems. In my experience, they waste time and money. | ||||||||
|