▲ | richardblythman 20 hours ago | |||||||||||||||||||||||||
If coding agents are the new entry point to your library, how sure are you that they’re using it well? I asked this question to about 50 library maintainers and dev tool builders, and the majority didn't really know. Existing code generation benchmarks focus mainly on self-contained code snippets and compare models not agents. Almost none focus on library-specific generation. So we built a simple app to test how well coding agents interact with libraries: • Takes your library’s docs • Automatically extracts usage examples • Tasks AI agents (like Claude Code) with generating those examples from scratch • Logs mistakes and analyzes performance We’re testing libraries now, but it’s early days. If you're interested: Input your library, see what breaks, spot patterns, and share the results below. We plan to expand to more coding agents, more library-specific tasks, and new metrics. Let us know what we should prioritize next. | ||||||||||||||||||||||||||
▲ | bdhcuidbebe 18 hours ago | parent | next [-] | |||||||||||||||||||||||||
> If coding agents are the new entry point to your library, how sure are you that they’re using it well? > I asked this question to about 50 library maintainers and dev tool builders, and the majority didn't really know. Why should they even bother to answer such a loaded and hypothetical question? | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | justonceokay 19 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
If making dev tooling is selling shovels to the miners, then this is like selling sheet metal to the shovel makers. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | dotancohen 19 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
Note that this comment is not hijacking. The author of this comment is also the author of the post. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | weitendorf 18 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
Let’s meet and see if it might make sense for us to team up. We’re working on this from the agent/library-specific-task side, and we might be better than chatgpt at marketing your product :) | ||||||||||||||||||||||||||
▲ | spankalee 18 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
Why do we need to log in? | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | mxkopy 5 hours ago | parent | prev [-] | |||||||||||||||||||||||||
IMO a tool like this doesn’t make sense until the hallucination problem is fixed |