| ▲ | AI Code Is Going to Kill Your Startup (and You're Going to Let It)(medium.com) | |||||||
| 10 points by tanelpoder 12 hours ago | 7 comments | ||||||||
| ▲ | simonw 10 hours ago | parent | next [-] | |||||||
I don't like how this article says this: > When researchers analyzed over 100 different large language models across 80 real-world coding scenarios — the kind of stuff you’d actually build in production — they found vulnerabilities in 45% of cases. But then fails to cite the research in question. I dug around and it's this report from security vendor Veracode: https://www.veracode.com/resources/analyst-reports/2025-gena... - PDF https://www.veracode.com/wp-content/uploads/2025_GenAI_Code_... That report is very thin on actual methodology. It's hard to determine how credible it is without seeing the prompts they were passing to the models. They do provide this: > Each coding task consists of a single function in one of the target languages. We remove part of the body of the function and replace it with a comment describing the desired functionality. With this one example:
This bit from the linked article really set off my alarm bells:> Python, C#, and JavaScript hover in the 38–45% range, which sounds better until you realize that means roughly four out of every ten code snippets your AI generates have exploitable flaws. That's just obviously not true. I generate "code snippets" hundreds of times a day that have zero potential to include XSS or SQL injection or any other OWASP vulnerability. | ||||||||
| ||||||||
| ▲ | combocosmo 9 hours ago | parent | prev | next [-] | |||||||
Of course a bit anecdotal, but not once has either Gemini or ChatGPT suggested me anything with eval or shell=True in it for Python. Admittedly I only ask it for specific problems, "this is your input, write code that outputs that" kind of stuff. I find it hard to believe that nearly 50% of AI generated python code contains such obvious vulnerabilities. Also, the training data should be full of warnings against eval/shell=True... Author should have added more citations. | ||||||||
| ▲ | throwawaysleep 11 hours ago | parent | prev | next [-] | |||||||
Do security breaches actually break companies? | ||||||||
| ||||||||
| ▲ | yahoozoo 10 hours ago | parent | prev | next [-] | |||||||
Is there anywhere to see examples of the insecure code generated by an LLM? | ||||||||
| ▲ | 9 hours ago | parent | prev [-] | |||||||
| [deleted] | ||||||||