| ▲ | pipes 5 hours ago | ||||||||||||||||||||||||||||||||||
I'll probably be proven wrong eventually, but my main thought about spec driven dev with llms is that it introduces an unreliable compiler. It will produced different results every time it is run and it's up to the developer to review the changes. Which just seems like a laborious error prone task. | |||||||||||||||||||||||||||||||||||
| ▲ | CuriouslyC 42 minutes ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
No, this is the right take. Spec driven development is good, but having loose markdown "specs" that leave a bunch up to the discretion of the LLM is bad. The right approach is a project spec DSL that agents write, which can be compiled via codegen in a more controlled way. | |||||||||||||||||||||||||||||||||||
| ▲ | Kiro 5 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
Why would you want to rerun it? In that context a human is also an unreliable compiler. Put two humans on the task and you will get two different results. Even putting the same human on the same task again will yield something different. LLMs producing unreliable output that can't be reproduced is definitely a problem but not in this case. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
| ▲ | mexicocitinluez 4 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
You don't need this type of work to be deterministic. It doesn't really matter if the LLM names a function "IsEven" vs "IsNumberEvent". Have you ever written the EXACT same code twice? > it introduces an unreliable compiler. So then by definition so our humans. If compiling is "taking text and converting it to code" that's literally us. > it's up to the developer to review the changes. Which just seems like a laborious error prone task. There are trade-offs to everything. Have you ever worked with an off-shore team? They tend to produce worse code and have 1% of the context the LLM does. I'd much rather review LLM-written code than "I'm not even the person you hired because we're scamming the system" developers. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||