| ▲ | rglover 4 hours ago | ||||||||||||||||
> I have the feeling that the introduction of automatic QA may raise the bar of quality for new releases of software, and maybe partially compensate for the lower quality of the code produced at high speed with the use of automatic programming. In theory. The only difference between today and "the aughts" is that we have machines that can spit out a ton of code very quickly. Nothing has changed about the discipline or honesty around testing (you can skip automated tests even faster now if you wish). You can and should work with AI to write tests, but you have to know the difference between a good test and a "looks good on paper" test in order for it to truly be effective and raise the quality of what you're building. | |||||||||||||||||
| ▲ | onlyrealcuzzo 2 hours ago | parent [-] | ||||||||||||||||
> I have the feeling that the introduction of automatic QA may raise the bar of quality for new releases of software, and maybe partially compensate for the lower quality of the code produced at high speed with the use of automatic programming. I've been building a compiler with LLMs for a memory safe language like Rust with near zero cost abstractions (no GC), but with WAY less cognitive overhead. I can tell you right now: 1) It's 100x more than I could have achieved with zero compiler design experience. 2) I'm HIGHLY skeptical that LLMs can build something of this complexity (in some ways it's more difficult than implementing a Rust compiler) - so the testing is quite robust - 3 different systems (unit, integration, fuzz tests) each with mutant testing, each with between ~65-90% line coverage and ~50-80% branch coverage, combined with ~99% line coverage and ~86% branch coverage. There is ZERO chance I could get something even close to this level of "working" by myself ever - let alone with minimal effort. The test is kind of simple - if LLM's can do this... They should be able to do just about anything... Compilers are notoriously difficult to verify they actually work, rather than just kind of work sometimes... People can say I'm wasting my time all they want. But, one, it's been enlightening. I'm literally in awe of what they can do and have done. Two, I've developed a bunch of tooling / metrics necessary to get them to be able to do something at this level of complexity without falling over themselves. And I think it can work at scale pretty easily. Nearly all of the research comes from the 80s or farther back for the complexity metrics. | |||||||||||||||||
| |||||||||||||||||