| ▲ | dmitrygr 3 days ago | |||||||||||||||||||||||||
Now do it without those pre-written tests. Spec only. Else, the writers of those tests deserve a LOT of credit. | ||||||||||||||||||||||||||
| ▲ | pseudosavant 3 days ago | parent | next [-] | |||||||||||||||||||||||||
If there is one thing that that agents/LLMs have highlighted, it is how much credit those test writers do deserve. Teams that were already following a TDD-style approach seem to be able to realize value from agents most easily because of their tests. The tests are what enable: building a brand new JS runtime that works, rewriting a complex piece of code in a different language (e.g. Golang instead of TypeScript) that is more performant for that task, or even migrating off of an old stack (.NET WebForms) to something newer. | ||||||||||||||||||||||||||
| ▲ | ivankra 2 days ago | parent | prev | next [-] | |||||||||||||||||||||||||
You can prompt an LLM to generate tests from the spec and I'd bet it would easily get most of the way there, especially if you give it a reference implementation to test against. I did just that, though on a small scale - just for feature tests. The last few percent would be the real challenge, you probably don't want it to just imitate another implementation's bugs. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | UncleEntity 2 days ago | parent | prev [-] | |||||||||||||||||||||||||
> Now do it without those pre-written tests That's probably the most important thing, actually. I've tried my hardest to get Claude to build an APL VM using only the spec and it's virtually impossible to get full compliance as it takes too many shortcuts and makes too many assumptions. That's part of the challenge though, to see how far the daffy robots have come. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||