| ▲ | xienze 7 hours ago | |||||||||||||||||||||||||
> I think that the prompt is the thing that should be PR'd at this point, because ultimately the spec is what's important. The fundamental problem there is the code generation step is non-deterministic. You might make a two sentence change to the prompt to fix a bug and the generation introduces two more. Generate again and everything is fine. Way too much uncertainty to have confidence in that approach. | ||||||||||||||||||||||||||
| ▲ | tombert 7 hours ago | parent [-] | |||||||||||||||||||||||||
If you make the prompts specific enough and provide tests that it has to run before it passes, then it should be fairly close to deterministic. Also, people aren't actually reading through most of the code that is generated or merged, so if there's a fear of deploying buggy code generated by AI, then I assure you that's already happening. A lot. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||