▲ | sensanaty a day ago | |
And we also have a showcase from a day ago [1] of these magical autonomous AI agents failing miserably in the PRs unleashed on the dotnet codebase, where it kept reiterating it fixed tests it wrote that failed without fixing them. Oh, and multiple blatant failures that happened live on stage [2], with the speaker trying to sweep the failures under the rug on some of the simplest code imaginable. But sure, it managed to find a name buried in some emails after being told to... Search through emails. Wow. Such magic [1] https://news.ycombinator.com/item?id=44050152 [2] https://news.ycombinator.com/item?id=44056530 |