| ▲ | The fate of "small" open source(nolanlawson.com) |
| 259 points by todsacerdoti 20 hours ago | 197 comments |
| |
|
| ▲ | p0w3n3d 7 hours ago | parent | next [-] |
| Given that some 80% of developers are now using AI in their regular work, blob-util is almost certainly the kind of thing that most developers would just happily have an LLM generate for them. Sure, you could use blob-util, but then you’d be taking on an extra dependency, with unknown performance, maintenance, and supply-chain risks.
Letting LLM write utility code is a sword that cuts both ways. You often create a throw-away code that is unproven and requires maintenance. It's not a guarantee that the blobutil or toString or whatever created by AI won't fail at some edge cases. That's why e.g. in Java there is Apache commons which is perceived as an industry standard nowadays. |
| |
| ▲ | jaapz 6 hours ago | parent | next [-] | | Exactly. When you assume blob-util to be a utility library that has been in use for quite a while by many people in many different contexts, hasn't seen much changes and just "works", IMHO the risk of weird bugs is a lot larger with LLM-generated code. Code generated by LLM's often have the problem that the code seems logical, but then contain weird bugs that aren't immediately obvious. | | |
| ▲ | p0w3n3d 6 hours ago | parent [-] | | Agreed. The code available in e.g. Apache commons is not perfect, but it's seasoned and has the edge cases documented |
| |
| ▲ | safety1st 3 hours ago | parent | prev | next [-] | | This mostly sounds like a good thing to me from a utilitarian standpoint. Getting all your utility classes from somewhere like npm and creating dependencies on 20 different people and organizations who may or may not maintain their software has been a security nightmare with many highly public examples. If a LLM writes a utility class for me then my supply chain is smaller, meaning less surface area to attack plus I probably benefit from some form of security through obscurity for whatever non-trivial amount that's worth. "Downside" is I don't have some rando, probably unpaid labor out there updating a piece of my app for me... | | |
| ▲ | philipov 35 minutes ago | parent [-] | | Your supply chain is superficially fewer, but not smaller. The way you're counting the number of suppliers is heterogeneous: ChatGPT has a bigger surface area than 20 individuals. | | |
| ▲ | safety1st 11 minutes ago | parent [-] | | Your supply chain is smaller in the sense that every person or organization you obtain code from is similar to a vendor, just an unpaid one. They are a separate entity your business depends on. If we replace code written by 20 of those organizations with code written by ChatGPT, we've gone from 20 code "vendors" we don't know much about who have no formal agreement to speak of with us, to 1 tools vendor which we can even make an enterprise agreement with and all that jazz. So whatever else the outcome may be, from this perspective it reduces uncertainty to quit using random npm packages and generate your utility classes with ChatGPT. I think this is the conclusion many businesses may reach. |
|
| |
| ▲ | Cthulhu_ 2 hours ago | parent | prev | next [-] | | It's not a new thing either, many years ago there was already the debate whether you should trust utility code copied from SO or use an NPM library. In fact, I'm 99% confident that the slew of single function NPM libraries became a thing because of that mindset. | |
| ▲ | Menschlio 4 hours ago | parent | prev [-] | | The proper way to do it, would be to have an industry standard on the default things people blindly but massivly pull through dependencies. I also don't get how code can be so massivly inefficient. left-pad needs 9kb to download and the code is a handful of lines: https://www.npmjs.com/package/left-pad?activeTab=code If my unit tests run through, i don't have 'unproven' code. I have well working code which doesn't need to go through a dependency hell upgrade cycle just because one function in that lib, i don't use, has some CVE too high to be ignored. |
|
|
| ▲ | RyanHamilton 20 hours ago | parent | prev | next [-] |
| Less incentive to write small libraries. Less incentive to write small tutorials on your own website. Unless you are a hacker or a spammer where your incentives have probably increased. We are entering the era of cheap spam of everything with little incentive for quality. All this for the best case outcome of most people being made unemployed and rolling the dice on society reorganising to that reality. |
| |
| ▲ | NitpickLawyer 20 hours ago | parent | next [-] | | > or a spammer where your incentives have probably increased. Slight pushback on this. The web has been spammed with subpar tutorials for ages now. The kind of medium "articles" that are nothing more than "getting started" steps + slop that got popular circa 2017-2019 is imo worse than the listy-boldy-emojy-filled articles that the LLMs come up with. So nothing gained, nothing lost imo. You still have to learn how to skim and get signals quickly. I'd actually argue that now it's easier to winnow the slop. I can point my cc running in a devcontainer to a "tutorial" or lib / git repo and say something like "implement this as an example covering x and y, success condition is this and that, I want it to work like this, etc.", and come back and see if it works. It's like a litmus test of a tutorial/approach/repo. Can my cc understand it? Then it'll be worth my time looking into it. If it can't, well, find a different one. I think we're seeing the "low hanging fruit" of slop right now, and there's an overcorrection of attitude against "AI". But I also see that I get more and more workflows working for me, more or less tailored, more or less adapted for me and my uses. That's cool. And it's powered by the same underlying tech. | | |
| ▲ | skydhash 19 hours ago | parent | next [-] | | The thing is, what is the actual point of this approach? Is it for leaning? I strongly believe there’s no learning without inmersion and practice. Is it for automation? The whole idea of automation is to not think about the thing again unless there’s a catastrophic error, it’s not about babysitting a machine. Is it about judgment? Judgment is something you hone by experiencing stuff then deciding whether it’s bad or not. It’s not something you delegate lightly. | |
| ▲ | NegativeK 19 hours ago | parent | prev | next [-] | | The problem isn't that AI slop is doing something new. Phishing, blogspam, time wasting PRs, website scraping, etc have all existed before. The problem is that AI makes all of that far, far easier. Even using tooling to filter articles doesn't scale as slop grows to be a larger and larger percentage of content, and it means I'm going to have to consider prompt injections and running arbitrary code. All of this is a race to the bottom of suck. | |
| ▲ | AstroBen 19 hours ago | parent | prev [-] | | The difference is that the cost of slop has decreased by orders of magnitude. What happens when only 1 in 10,000 of those tutorials you can find is any good, from someone actually qualified to write it? | | |
| ▲ | NewsaHackO 18 hours ago | parent | next [-] | | One instance of definite benefit of AI is AI summary web search. Searching for answers to simple questions and not having to cut though SEO slop is such an improvement | | |
| ▲ | bloomca 17 hours ago | parent | next [-] | | The summary is often incorrect in at least some subtle details, which is invisible to a lot of people who do not understand LLM limitations. Now, we can argue that a typical SEO-optimized garbage article is not better, but I feel like the trust score for them was lower on average from a typical person. | | |
| ▲ | immibis 6 hours ago | parent [-] | | Marketing departments are already speaking of GEO - generative engine optimization. When a user asks an AI for the best X, you want it to say your X is the best, and you'll do whatever it takes to achieve that. |
| |
| ▲ | cml123 17 hours ago | parent | prev | next [-] | | I don't think searching for answers to simple questions was a problem until Google nerfed their own search engine. | | |
| ▲ | whatevertrevor 9 hours ago | parent | next [-] | | I don't understand this position, do you have direct evidence that Google actively made search worse? Before I'm misunderstood I do want to clarify that IMO, the end user experience for web searching on Google is much worse in 2025 than it was in say 2000. But, the web was also much much smaller, less commercial and the SNR was much better in general. Sure, web search companies moved away from direct keyword matching to much more complex "semantics-adjacent" matching algorithms. But we don't have the counterfactual keyword-based Google search algorithm from 2000 on data from 2025 to claim that it's just search getting worse, or the problem simply getting much harder over time and Google failing to keep up with it. In light of that, I'm much more inclined to believe that it's SEO spam becoming an industry that killed web search instead of companies "nerfing their own search engines". | | |
| ▲ | johnnyanmac 6 hours ago | parent | next [-] | | >do you have direct evidence that Google actively made search worse? sure. https://www.wheresyoured.at/the-men-who-killed-google/ >These emails — which I encourage you to look up — tell a dramatic story about how Google’s finance and advertising teams, led by Raghavan with the blessing of CEO Sundar Pichai, actively worked to make Google worse to make the company more money. This is what I mean when I talk about the Rot Economy — the illogical, product-destroying mindset that turns the products you love into torturous, frustrating quasi-tools that require you to fight the company’s intentions to get the service you want. Of course, it's hard to "objectively" prove that they literally made search worse, but it's clear they were fine with stagnating in order to maximize ad revenue. I see it as the same way Tinder works if you want the mentality. There's a point where being "optimal" hurts your bottom line, so you don't desire achieving a perfect algorithm. Meanwhile, it can be so bad for Google that directly searching for a blog title at times can leave me unsuccessful. | | |
| ▲ | eloisant 4 hours ago | parent [-] | | > I see it as the same way Tinder works if you want the mentality. There's a point where being "optimal" hurts your bottom line, so you don't desire achieving a perfect algorithm Yes, in the case of Google: - They make more money from ads if the organic results are not as good (especially if it's not clear they're add) - They get more impressions if you don't find the answer at the first search and have to try a different query |
| |
| ▲ | friendzis 6 hours ago | parent | prev [-] | | > In light of that, I'm much more inclined to believe that it's SEO spam becoming an industry that killed web search instead of companies "nerfing their own search engines". "SEO" is not some magic, it is "compliance with ranking rules of the search engine". Google wanted to make their lives easier, implemented heuristics ranking slop higher, resulting in two things happening simultaneously: information to slop ratio decreasing AND information getting buried deeper and deeper within SRPs. > do you have direct evidence that Google actively made search worse? https://support.google.com/google-ads/answer/10286719?hl=en-...
Google is literally rewriting the queries. Not only results with better potential for ads outrank more organic results, it is impossible to instruct the search engine to not show you storefronts even if you tried. |
| |
| ▲ | NewsaHackO 10 hours ago | parent | prev [-] | | Pretty sure Google attempting to curb SEO tactics is what led to whatever nerfing you are talking about. | | |
| ▲ | cml123 an hour ago | parent [-] | | granted it's not up to courtroom standards, this post linked by another commenter in the chain does paint the picture pretty well of an internal struggle between Search and Ads inside Google as a company, where there was a decision to promote user-negative changes to Search as a way to increase the total number of searches performed, thereby increasing the number of ads that can be shown. This happened during 2019. https://www.wheresyoured.at/the-men-who-killed-google/ |
|
| |
| ▲ | AstroBen 17 hours ago | parent | prev | next [-] | | There was a time before SEO slop that web search was really valuable We're fighting slop with condensed slop | | |
| ▲ | eloisant 4 hours ago | parent [-] | | Before Google the web was already full of SEO slop. Except "SEO" just meant "put a list of popular keywords in a block hidden with CSS". That was the time of Altavista and other search engines. So the time you're talking about is a window when Google existed, but before they gave up on fighting spam. |
| |
| ▲ | inferiorhuman 14 hours ago | parent | prev [-] | | Hard disagree. AI summaries are useless for the same reason AI summaries from Google and DDG are useless: it's almost always missing the context. The AI page summaries typically take the form of "here's the type of message that the author of this page is trying to convey" instead of "here's what the page actually says". Just give me the fucking contents. If I wanted AI slop I'd ask my fucking doorknob. | | |
| ▲ | NewsaHackO 10 hours ago | parent [-] | | I think you have some of your wires crossed, asking Google for "here's the type of message that the author of this page is trying to convey" is not what most people think is a simple question (also asking Google to reprint copyrighted material us also a non starter). Asking Google "what is the flag for persevering Metadata using scp" and getting the flag name instead of a SEO article with the a misleading title go on about so third party program that you can download that does exactly that and never actually tell you the answer is ridiculous and I am happy AI has help reduce the click bait | | |
| ▲ | inferiorhuman 9 hours ago | parent [-] | | "here's the type of message that the author of this page is trying to convey" is not what most people think is a simple question
It's also not the question I asked. I'm literally trying to parse out what question was asked. That's what makes AI slop so infuriating: it's entirely orthogonal to the information I'm after. Asking Google "what is the flag for persevering Metadata using scp" and getting the flag
name instead of a SEO article with the a misleading title go on about so third party program
that you can download that does exactly that and never actually tell you the answer is
ridiculous and I am happy AI has help reduce the click bait
Except that the AI slop Google and Microsoft and DDG use for summaries masks whether or not a result is SEO nonsense. Instead of using excerpts of the page the AI summary simply suggests that the SEO garbage is answering the question you asked. These bullshit AI summaries make it infinitely harder to parse out what's actually useful. I suppose that's the goal though. Hide that most of the results are low quality and force you to click through to more pages (ad views) to find something relevant. AI slop changes the summaries from "garbage in, garbage out" to simply "garbage out". | | |
| ▲ | NewsaHackO 23 minutes ago | parent [-] | | At least with Google, it quotes the pages where it gets the information from. Also, I think you are definitely underplaying the fact that it answers the question in one sentence,as wellas the whole ask a question get a compact answer. I am going to need a concrete example, because in my experience, the AI summary has never even required me to verify the source except out of curiosity, much less click on any search results. |
|
|
|
| |
| ▲ | weitendorf 17 hours ago | parent | prev [-] | | What happens when the monkeys stop getting bananas to work on the typewriters? More stories? | | |
| ▲ | johnnyanmac 6 hours ago | parent [-] | | Sadly these aren't monkeys. These are more like termites eating at any and all wood they can find. They'll eat at the foundation and move to the next trend to eat at. Spam by its nature is low effort, low yields anyway. They don't particularly care about making scraps since their pipeline is nearly automated. |
|
|
| |
| ▲ | justarandomname 14 hours ago | parent | prev | next [-] | | I was searching a specific niche on Youtube today, and scrolled endlessly trying to find something that wasn't AI generated. Youtube is being completely spammed. | | |
| ▲ | skandinaff 6 hours ago | parent [-] | | On that - naive proposition; shouldn't we establish say "humanTube" - service which would strictly prohibit AI content? With all this AI slop engulfing our web 2.0 - maybe this is the time and place to establish the new web for "nerds" i.e. people who care for the real thing? Just as our current web once was a place for scientists and engineers mostly, maybe we now need something as this? I feel the flaws in my own point, but maybe it's not all hopeless? | | |
| ▲ | johnnyanmac 5 hours ago | parent | next [-] | | I think by this point that premium services dedicated to quality is going to be the way to avoid the flood of AI slop that's come to us. Premium services mean QA and accountability should anything try to slip through. Closest thing in the YT space would be Nebula, but Nebula's scope is very narrow (by design). | |
| ▲ | danielbln 6 hours ago | parent | prev [-] | | We'll get content that is indistinguishable from human curated content before long, we might even already have that and it's just toupee fallacy making us only see the slop. I'm making no value statement here, just that any sort of curation attempts are probably futile. |
|
| |
| ▲ | phoronixrly 20 hours ago | parent | prev | next [-] | | > We are entering the era of cheap spam of everything with little incentive for quality Correction -- sadly, we're already well within this era | |
| ▲ | zwnow 20 hours ago | parent | prev | next [-] | | But some webdev said they are 10x faster now so it cant be bad for humanity /s | | |
| ▲ | _heimdall 3 hours ago | parent [-] | | They must be slackers, I heard from multiple sources that AI makes you a 100x developer. | | |
| ▲ | zwnow 39 minutes ago | parent [-] | | So this is why all the leading AI companies normalize 72 hour weeks. | | |
|
| |
| ▲ | weitendorf 17 hours ago | parent | prev [-] | | Quick! Tell me what this does or you're not a real programmer, not a craftsman, and are a total hack who understands nothing about quality (may not even be a good person, want to lay off bread-and-butter red-blooded american programmers) .global main .text main: mflr 27 mr 13,3 mr 14,4 addi 3,3,48 bl putchar li 3,10 bl putchar next: lwz 3,0(14) bl puts addi 14,14,4 addi 13,13,-1 cmpwi 13,0 bgt next lis 3,zone@ha addi 3,3,zone@l bl puts mtlr 27 mr 3,13 blr .data .align 2 zone: .string "Bonjour" | | |
| ▲ | llbbdd 11 hours ago | parent [-] | | Upvoting because it's a salient point and downvoters are mad. HN has a complex lately. | | |
| ▲ | pjc50 6 hours ago | parent | next [-] | | It's not even a point, it barely counts as sarcasm. I'm not sure why a chunk of hello world in ARM assembler is supposed to be relevant to anything, let alone if there's a trap hidden in there. | |
| ▲ | bakugo 7 hours ago | parent | prev | next [-] | | It's not a "salient point", it's a ridiculous strawman that has almost nothing to do with the topic being discussed. | |
| ▲ | imtringued 6 hours ago | parent | prev [-] | | The context is literally this: >Less incentive to write small libraries. Less incentive to write small tutorials on your own website. What weitendorf posted is definitively not a library, nor is there a small tutorial for the code. >Unless you are a hacker or a spammer where your incentives have probably increased. We are entering the era of cheap spam of everything with little incentive for quality. Considering the low effort to post and high effort to understand what weitendorf wrote, he might be considered a spammer given the context. The code quality is also low since his application can easily be replicated by a bunch of echo calls in a bash script, making me lean towards thinking he is a low quality spammer, given the context. >All this for the best case outcome of most people being made unemployed and rolling the dice on society reorganising to that reality. I'm not sure you can argue that weitendorf sufficiently addressed this. He put too much emphasis on an obvious strawman (real programmer) which is completely out of context. Nobody is questioning here whether someone is a programmer or not. There is no gatekeeping whatsoever. You're free to use LLMs. I'll also complain about your use of "salient" here, which generally has two meanings. The first is that something is "eye catching" (making me think more of spam), the second meaning is "relevancy/importance" to a specific thing and that's where weitendorf falls completely flat. Now you might counter and argue that he packaged all of his salient points inside the statement "want to lay off bread-and-butter red-blooded american programmers", then your position is incredibly weak, because you're deflecting from one strawman to another strawman or alternatively, your counterargument will rely heavily on reinterpretation, which again just means the point wasn't salient. |
|
|
|
|
| ▲ | noosphr 17 hours ago | parent | prev | next [-] |
| I see this as an absolute win. The state of micro dependencies of js was a nightmare that only happened because a lot of undereducated developers flooded the market to get that sweet faang money. Now that both have dried up I hope we can close the vault door on js and have people learn how to code again. |
| |
| ▲ | SchemaLoad 16 hours ago | parent | next [-] | | The best outcome was things like jquery and then lodash where a whole collection of small util functions get rolled in to one package. | | |
| ▲ | josephg 12 hours ago | parent [-] | | Oh god, without tree shaking, lodash is such a blight. I've seen so many tiny packages pull in lodash for some little utility method so many times. 400 bytes of source code becomes 70kb in an instant, all because someone doesn't know how to filter items in an array. And I've also seen plenty of projects which somehow include multiple copies of lodash in their dependency tree. Its such a common junior move. Ugh. Experienced engineers know how to pull in just what they need from lodash. But ... most experienced engineers I know & work with don't bother with it. Javascript includes almost everything you need these days anyway. And when it doesn't, the kind of helper functions lodash provides are usually about 4 lines of code to write yourself. Much better to do that manually rather than pull in some 70kb dependency. | | |
| ▲ | friendzis 7 hours ago | parent | next [-] | | > 400 bytes of source code becomes 70kb in an instant, This only shows how limited and/or impractical dependency management story is. The whole idea behind semver is that at the public interface level patch version does not matter at all and minor versions can be upped without breaking changes, therefore a release build should be safe to only include major versions referenced (or on the safe side, the highest version referenced). > Its such a common junior move. Ugh. I can see this happening if a version is pinned at an exact patch version, which is good for reproducibility, but that's what lockfiles are for. The junior moves are to pin a package at an exact patch version and break backwards compatibility promises made with semver. > Experienced engineers know how to pull in just what they need from lodash. But ... IMO partial imports are an antipattern. I don't see much value in having exact members imported listed out at the preamble, however default syntax pollutes the global namespace, which outweighs any potential benefits you get from members listed out the preamble. Any decent compiler should be able to shake dead code in source dependencies anyway, therefore there should not be any functional difference between importing specific members and importing the whole package. I have heard an argument that partial imports allow one to see which exact `sort` is used, but IMO that's moot, because you still have to perform static code analysis to check if there are no sorts used from other imported packages. | | |
| ▲ | josephg 2 hours ago | parent | next [-] | | > Any decent compiler should be able to shake dead code in source dependencies anyway, therefore there should not be any functional difference between importing specific members and importing the whole package. Part of the problem is that a javascript module is (or at least used to be) just a normal function body that gets executed. In javascript you can write any code you want at the global scope - including code with side effects. This makes dead code elimination in the compiler waay more complicated. Modules need to opt in to even allowing tree shaking by adding sideEffects: false in package.json - which is something most people don't know to do. > I don't see much value in having exact members imported listed out at the preamble The benefit to having exact members explicitly imported is that you don't need to rely on a "sufficiently advanced compiler". As you say, if its done correctly, the result is indistinguishable anyway. In my mind, anything that helps stop all of lodash being pulled in unnecessarily is a win in my books. A lot of javascript projects need all the help they can get. | |
| ▲ | zahlman 3 hours ago | parent | prev [-] | | > The whole idea behind semver is that at the public interface level patch version does not matter at all and minor versions can be upped without breaking changes, therefore a release build should be safe to only include major versions referenced (or on the safe side, the highest version referenced). ... Sorry, what does that have to do with tree shaking? |
| |
| ▲ | SchemaLoad 10 hours ago | parent | prev | next [-] | | I agree the JS standard library includes most of the stuff you need these days, rendering jquery and half of lodash irrelevant now. But there's still a lot of useful utilities in lodash, and potentially a new project could curate a collection of new, still relevant utilities. | | |
| ▲ | josephg 10 hours ago | parent [-] | | Can you give some examples? I’ve written js / ts for 10-15 years and I’ve never reached for lodash in my life. | | |
| ▲ | epicureanideal 9 hours ago | parent [-] | | _.any, some, every, keyBy. | | |
| ▲ | skrebbel 8 hours ago | parent [-] | | _.any doesn’t seem to exist. Some and every are also in JS (with the same names even!). keyBy is just Array.map -> Object.fromEntries | | |
| ▲ | dsego 5 hours ago | parent [-] | | It was more useful before when browsers didn't support array.map and fromEntries. That's the origin of all these libraries, but browsers caught up. Things like keyBy, groupBy, debounce, uniqueId, and some others, are still useful. |
|
|
|
| |
| ▲ | miki123211 8 hours ago | parent | prev | next [-] | | The problem with helper functions is that they're often very easy to write, but very hard to figure out the types for. Take a generic function that recursively converts snake_case object keys to pascalCase. That's about 10 lines of Javascript, you can write that in 2 mins if you're a competent dev. Figuring out the types for it can be done, but you really need a lot of ts expertise to pull it off. | | |
| ▲ | shakow 8 hours ago | parent [-] | | Not really familiar with TS, but what would be so weird with the typing? Wouldn't it be generic over `T -> U`, with T the type with snake_case fields and U the type with pascalCase fields? | | |
| |
| ▲ | debtta 7 hours ago | parent | prev [-] | | Unless you're part of the demoscene or your webpage is being loaded by Voyager II, why is 70kb of source code a problem? Not wanting to use well constructed, well tested, well distributed libraries to make code simpler and more robust is not motivated by any practical engineering concern. It's just nostalgia and fetishism. | | |
| ▲ | josephg 5 hours ago | parent | next [-] | | > why is 70kb of source code a problem? Because javascript isn't compiled. Its distributed as source. And that means the browser needs to actually parse all that code before it can be executed. Parsing javascript is surprisingly slow. 70kb isn't much on its own these days, but it adds up fast. Add react (200kb), a couple copies of momentjs (with bundled timezone databases, of course) (250kb or something each) and some actual application code and its easy to end up with ~1mb of minified javascript. Load that into a creaky old android phone and your website will chug. For curiosity's sake, I just took a look at reddit in dev tools. Reddit loads 9.4mb of javascript (compressed to 3mb). Fully 25% of the CPU cycles loading reddit (in firefox on my mac) were spent in EvaluateModule. This is one of the reasons wasm is great. Wasm modules are often bigger than JS modules, but wasm is packed in an efficient binary format. Browsers parse wasm many times faster than they parse javascript. | |
| ▲ | EdiX 6 hours ago | parent | prev [-] | | > why is 70kb of source code a problem? It isn't but then everyone does it and then everyone does it recursively and 70kb become 300MB and then it matters. Not to mention that "well constructed, well tested, well distributed" are often actually overengineered and poorly maintained. |
|
|
| |
| ▲ | fransje26 4 hours ago | parent | prev | next [-] | | > Now that both have dried up They have?? And is it not going to be orders of magnitude worse with the vibe-coded crud hitting the internet? | |
| ▲ | KPGv2 16 hours ago | parent | prev | next [-] | | Your first sentence cheers that we're moving from NPM micropackages to LLM-generated code, and then you say this will result in people having to learn to code again. I don't see how the conclusion follows from this. There will be many LLM-generated functions purporting to do the same thing, and a bug in one of them that gets fixed means only one project gets fixed instead of every project using an NPM package as a dependency. | | |
| ▲ | finaard 8 hours ago | parent | next [-] | | There will be many LLM-generated functions doing the same thing, in the same project, unless the human pays attention. I've been playing a lot recently with various models, lately with the expensive claude models (API, for the large context windows), and in every attempt things are really impressive at the beginning, and start going south once the codebase reaches about 10k to 15k lines of code. Even with tools split out into a separate library and separate documentation at that point it has a tendency to generate tool functions again in the module it's currently working on over taking the already defined one in the helper library. | |
| ▲ | austin-cheney 9 hours ago | parent | prev [-] | | You mention that you don’t see how this could result in developers that write code. Then you immediately follow that with fixing bugs. Yes, use of LLMs is still developers not writing original code, but it’s still an improvement (a minor one) of the copy/paste of micro dependencies. Ultimately developers aren’t going to figure out writing original code until they are forced to do so from changed conditions within their employer. |
| |
| ▲ | danelski 16 hours ago | parent | prev | next [-] | | I don't quite understand your argument. Wasn't the post about how users might replace transparent dependencies with transparent LLM drop-ins? I don't see how having an LLM to do the same job would enable someone to learn more. They're probably the kind of person who will ask the LLM to perform a refactor when problems arise, so they won't learn that much through osmosis. | |
| ▲ | croes 16 hours ago | parent | prev [-] | | Now you got vibe coder | | |
| ▲ | mrweasel 2 hours ago | parent [-] | | Yeah, you're sort of swapping in one issue for another. I agree that micro dependencies needs to be tackled, but perhaps not using LLMs. I could fear that rather than keep pushing for a Javascript standard library, which would encompass all these smaller function, we now just get more or less the same defective implementations generated by LLMs, but hidden in thousands of repos, where tools won't find security issues. At least with NPM we can pull in updated versions with NPM tells us that we're running an outdated version. Who is going to traverse your proprietary code base and let you know that the vibe coded left-pad Claude put in three years ago is buggy? |
|
|
|
| ▲ | Aperocky 19 hours ago | parent | prev | next [-] |
| > the era of small, low-value libraries like blob-util is over. Thankfully (not against blob-util specifically because I've never intentionally used it), I wouldn't completely blame llms either since languages like Go never had this dependency hell. npm is a security nightmare not just because of npm the package manager, because the culture of the language rewards behavior such as "left-pad". Instead of writing endless utilities for other project to re-use, write actual working things instead - that's where the value/fun is. |
| |
| ▲ | ncruces 19 hours ago | parent | next [-] | | But as Go puts it: “A little copying is better than a little dependency.” https://go-proverbs.github.io/ | | |
| ▲ | threatofrain 18 hours ago | parent | next [-] | | Copying is just as much dependency, you just have to do maintenance through manual find-and-replace now. | | |
| ▲ | ninkendo 18 hours ago | parent | next [-] | | > you just have to do maintenance through manual find-and-replace now Do you? It doesn't seem even remotely like an apples-to-apples comparison to me. If you're the author of a library, you have to cover every possible way in which your code might be used. Most of the "maintenance" ends up being due to some bug report coming from a user who is not doing things in the way you anticipated, and you have to adjust your library (possibly causing more bugs) to accommodate, etc. If you instead imaging the same functionality being just another private thing within your application, you only need to make sure that functionality works in the one single way you're using it. You don't have to make it arbitrarily general purpose. You can do error handling elsewhere in your app. You can test it only against the range of inputs you've already ensured are the case in your app, etc. The amount of "maintenance" is tiny by comparison to what a library maintainer would have to be doing. It seems obvious to me that "maintenance" means a much more limited thing when talking about some functionality that the rest of your app is using (and which you can test against the way you're using it), versus a public library that everyone is using and needs to work for everyone's usage of it. | | |
| ▲ | jorvi 16 hours ago | parent [-] | | > If you're the author of a library, you have to cover every possible way in which your code might be used. You don't actually. You write the library for how you use it, and you accept pull requests that extend it if you feel it has merit. If you don't, people are free to fork it and pull in your improvements periodically. Or their fork gets more popular, and you get to swap in a library that is now better-maintained by the community. As long as you pin your package, you're better off. Replicating code pretty quickly stops making sense. | | |
| ▲ | marcus_holmes 15 hours ago | parent [-] | | It's a rare developer (or human for that matter) who can just shrug and say "fork off" when asked for help with their library. | | |
| ▲ | jorvi 2 hours ago | parent | next [-] | | It really depends. If it's the occasional request and I can bang out a solution in 30 minutes, I'll help. But I'll also weigh how much maintenance burden it'll be going forward. And if I won't do it myself, I'd always give some quick pointers. Maintenance demands (your library X doesn't work with Python Y, please maintain another version for me) I'd shrug off. Wait for me, pay me, or fix it yourself. | |
| ▲ | j1elo 12 hours ago | parent | prev [-] | | It would be healthy that it becomes more common, in fact the privately-owned public garden model of the Valetudo project [1] is the sanest way for FOSS maintainers to look at their projects. [1]: https://github.com/Hypfer/Valetudo#valetudo-is-a-garden |
|
|
| |
| ▲ | noosphr 17 hours ago | parent | prev | next [-] | | Copied text does not inject bitcoin mining malware three months after I paste it. | | |
| ▲ | KPGv2 16 hours ago | parent [-] | | Neither does a dependency you don't update, though, which is isomorphic to copied code you never update. | | |
| ▲ | chii 4 hours ago | parent [-] | | somehow, in the js/npm world, dependencies are updated willy nilly, which is the cause of a lot of that ecosystem's headaches. |
|
| |
| ▲ | sodapopcan 18 hours ago | parent | prev | next [-] | | Usually these types if things never change. I understand that all code is a liability, but npm takes this way too far. Many utility functions can be left untouched for many years if not forever. | | |
| ▲ | KPGv2 16 hours ago | parent [-] | | It's not NPM. It's JS culture. I've done a lot of time programming in TypeScript, and it never fails that in JS programmer circles they are constantly talking about updating all their packages, completely befuddled why I'd be using some multiple year old version of a library in production, etc. Meanwhile Java goes the other way: twenty-year old packages that are serious blockers to improved readability. Running Java that doesn't even support Option (or Maybe or whatever it's called in Java). | | |
| ▲ | llbbdd 11 hours ago | parent [-] | | Java writes to a bytecode spec that has failed to keep up with reality, to its detriment. Web development keeps up with an evolving spec pushed forward by compatibility with what users are actually using. This is "culture" only in the most distant, useless sense of the word. It is instead context, which welcomes it back into the world of just fucking developing software, no matter how grey-haired HN gets with rage while the world moves on. EDIT: Obvious from the rest of your responses in this thread that this is trolling, leaving this up for posterity only |
|
| |
| ▲ | SchemaLoad 16 hours ago | parent | prev | next [-] | | Most of these util libraries require basically no changes ever. The problem is the package maintainers getting hacked and malicious versions getting pushed out. | | |
| ▲ | KPGv2 16 hours ago | parent [-] | | If you use an LLM to generate a function, it will never be updated. So why not do the same thing with a dependency? Install it once and never update it (and therefore hacked and malicious versions can never arrive in your dependency tree). You're a JS developer, right? That's the group who thinks a programmer's job includes constantly updating dependencies to the latest version constantly. | | |
| ▲ | llbbdd 11 hours ago | parent | next [-] | | You're not a web developer, right? See my other comment about context if you want to learn more about the role of context in software development in general. If you keep repeating whatever point you're trying to make about some imaginary driving force to pointlessly update dependencies in web dev, you'll probably continue to embarrass yourself, but it's not hard to understand if you read about it instead of repeating the same drivel under every comment in this thread. | |
| ▲ | nineteen999 13 hours ago | parent | prev [-] | | > Install it once and never update it (and therefore hacked and malicious versions can never arrive in your dependency tree). Huh? What if your once-off installation or vendoring IS a hacked an malicious version and you never realise and never update it. That's worse. | | |
| ▲ | llbbdd 11 hours ago | parent [-] | | Hardly worth responding to, from other comments they're defending Java. They're not used to updates. |
|
|
| |
| ▲ | ncruces 17 hours ago | parent | prev | next [-] | | Keyword: little. Dependencies need to pull their own weight. Shifting responsibilities is a risk that the value added needs to offset. | |
| ▲ | jamietanna 18 hours ago | parent | prev | next [-] | | Yeah it's the main thing I really dislike about this - how do you make sure you know where it's from? (ie licensing) What if there are updates you need? Are you going to maintain it forever? For some definition of "small piece of code" that may be ok, but also sometimes this is more than folks consider | | |
| ▲ | skydhash 18 hours ago | parent [-] | | Do you know that you can just add a small text file or a comment explaining that a module is vendored code. Ad updates is handled the same way as the rest of the code. And you will be “maintaining” it as long as you need to. Libraries are not “here be dragons” best left to adventurous ones. |
| |
| ▲ | msla 18 hours ago | parent | prev [-] | | If I vendor a dependency that currently works for what my program does, I only have to care about it again if a security hole is discovered in it or if my program changes and the dependency is insufficient in some way. I don't have to worry about the person I'm importing code from going weird or introducing a bug that affects me. |
| |
| ▲ | skydhash 18 hours ago | parent | prev [-] | | Yep something like blob util could be a blog post or a gist (or several stack overflow answers). And a lot of NPM library falls under that. They always have the anemic standard library of JavaScript forgetting that the C library is even smaller. |
| |
| ▲ | kermatt 18 hours ago | parent | prev [-] | | > since languages like Go never had this dependency hell What is the feature of Go that this is referring to? | | |
| ▲ | ncruces 17 hours ago | parent [-] | | It's a cultural thing. And libraries try harder not to have absurd dependencies, than finished products (correctly, IMO). |
|
|
|
| ▲ | mccoyb 19 hours ago | parent | prev | next [-] |
| I don’t think open source is going anywhere. It’s posed to get significantly stronger — as the devs which care about it learn how to leverage AI tools to make things that corporate greasemonkeys never had the inspiration to. Low quality code spammers are just marketing themselves for jobs where they can be themselves: soulless and devoid of creative impulse. That’s the thing: open source is the only place where the true value (or lack of value) of these tools can be established — the only place where one can test mettle against metal in a completely unconstrained way. Did you ever want to build a compiler (or an equally complex artifact) but got stuck on various details? Try now. It’s going to stand up something half-baked, and as you refine it, you will learn those details — but you’ll also learn that you can productively use AI to reach past the limits of your knowledge, to make what’s beyond a little more palatable. All the things people say about AI is true to some degree: my take is that some people are rolling the slots to win a CRUD app, and others are trying to use it to do things that they could only imagine before —- and open source tends to be the home of the latter group. |
| |
| ▲ | nowittyusername 19 hours ago | parent | next [-] | | True innovation will come from open source for sure. As the developers don't have the same economic incentives to be "safe", "ethical" "profitable" or whatever. large corporations know this and fear this development. That's why i expect a significant lobbying to take hold in USA that will try and make local AI systems illegal. And I think they will be very convincing to the government. Because the government also fears the "peasants" and giving them any true semblance of real AGI like systems. I bet very soon we will start seeing various classifications that will define what is legal and what is not for a citizen to possess or use. | | |
| ▲ | pessimizer 16 hours ago | parent [-] | | > That's why i expect a significant lobbying to take hold in USA that will try and make local AI systems illegal. I think they're going to be using porn and terrorism (as usual) to do that, but also child suicide. I also think they're going to leverage this rhetoric to lock down OSes in general, by making them uninstallable on legally-available hardware unless approved, because approved OSes will only be able to run approved LLMs. Meaning that I think LLMs/generative AI will be the lever to eliminate general-purpose computing. As mobile went, so will desktop. I think this is inevitable. The real question for me is whether China will partner with the west on this, or whether we will be trading Chinese CPUs with each other like contraband in order to run what we want. > any true semblance of real AGI like systems. This is the only part I don't agree with. This isn't going to happen, but I'm not even sure it would be more useful than what we have. We have billions of full AGI machines walking around, and most of them aren't great. I'm talking about restrictions on something technically barely better than what we have now; maybe only a significant bit more compute-efficient. Training techniques will probably be where we get the most improvements. |
| |
| ▲ | exasperaited 19 hours ago | parent | prev [-] | | > It’s posed to get significantly stronger It's really not. Every project of any significance is now fending off AI submissions from people who have not the slightest fucking clue about what is involved in working on long-running, difficult projects or how offensive it is to just slather some slop on a bug report and demand it is given scrutiny. Even at the 10,000 feet view it has wasted people's time because they have to sit down and have a policy discussion about whether to accept AI submissions, which involves people reheating a lot of anecdotal claims about productivity. Having learned a bit about how to write compilers I know enough to know that I can guarantee you that an AI cannot help you solve the difficult problems that compiler-building tools and existing libraries cannot solve. It's the same as it is with any topic: the tools exist and they could be improved, but instead we have people shoehorning AI bollocks into everything. | | |
| ▲ | doug_durham 18 hours ago | parent | next [-] | | This isn't an AI issue. It is a care issue. People shouldn't submit PRs to project where they don't care enough to understand the project they are submitting to or the code they are submitting. This has always been a problem, there is nothing new. The thing that is new is more people can get to a point where they can submit regardless of their care or understanding. A lot of people are trying to gild their resume by saying they contributed to a project. Blaming AI is blaming the wrong problem. AI is a a tool like a spreadsheet. Project owners should instead be working ways to filter out careless code more efficiently. | | |
| ▲ | johnnyanmac 5 hours ago | parent | next [-] | | That's why I'm not super optimistic. Even pre-AI and tech slump there were talks about how hard it may be to replace the old guard maintaining these open source initiatives. Now... >Blaming AI is blaming the wrong problem. AI is a a tool like a spreadsheet. Project owners should instead be working ways to filter out careless code more efficiently. When care leaves, the entire commons starts to fall apart. New talent doesn't come in. Old talent won't put up with it and retire out of the scene. They already have so much work to do, needing to add in non-development work to make better spam filters may very well be the final stray. Even when the careless leave, it won't bring back the talent lost. Directing the blame onto the sure won't do that. | |
| ▲ | exasperaited 18 hours ago | parent | prev [-] | | This is an AI issue because people, including the developers of AI tools, don't care enough. The Tragedy Of The Commons is always about this: people want what they want, and they do not care to prevent the tragedy, if they even recognise it. > Project owners should instead be working ways to filter out careless code more efficiently. Great. So the industry creates a burden and then forces people to deal with it — I guess it's an opportunity to sell some AI detection tools. | | |
| ▲ | danielbln 5 hours ago | parent [-] | | We don't need an AI detector, we need a "human vetted" detector. | | |
| ▲ | exasperaited 3 minutes ago | parent | next [-] | | People arguing against my point here seem to be doing a good job of validating my point. | |
| ▲ | johnnyanmac 5 hours ago | parent | prev [-] | | Who's paying the human to vet it? Or will we have volunteers dedicated to being AI detectors instead of developers? | | |
| ▲ | danielbln 5 hours ago | parent [-] | | I don't have those answers. My point was that trying to outright ban any AI is futile and probably overall counter productive, and that we need to find ways to ensure a human hasn't submitted slop. I don't have an answer as to the how. |
|
|
|
| |
| ▲ | windward 3 hours ago | parent | prev | next [-] | | >Every project of any significance is now fending off AI submissions Not anything with a cathedral model. 'open source' is too ambiguous to be useful. | |
| ▲ | theshrike79 5 hours ago | parent | prev | next [-] | | > Every project of any significance is now fending off AI submissions from people who have not the slightest fucking clue I'm kinda hoping that Github will provide an Anubis-equivalent for issue submissions by default. | |
| ▲ | mccoyb 19 hours ago | parent | prev | next [-] | | Sounds like a lot of FUD to me — if major projects balk at the emergence of new classes of tools, perhaps the management strategy wasn’t resilient in the first place? Further: sitting down to discuss how your project will adapt to change is never a waste of time, I’m surprised you stated it like that. In such a setting, you’re working within a trusted party — and for a major project, that likely means extremely competent maintainers and contributors. I don’t think these people will have any difficulty adapting to the usage of these tools … | | |
| ▲ | johnnyanmac 5 hours ago | parent | next [-] | | > if major projects balk at the emergence of new classes of tools, perhaps the management strategy wasn’t resilient in the first place? It's not the tools, it's the quality. No FOSS dev would care where the code came from if it followed the contributor's guidelines and coding style. This is why it's a spam issue. a bunch of low quality submissions only gum up the time of such developers and slows the entire process down. >that likely means extremely competent maintainers and contributors. Your assumption falls apart here, sadly. Dunning-Kruger hits hard here for new contributors powered by LLMs and the maintainers suffer the brunt of the hit. | | |
| ▲ | mccoyb an hour ago | parent [-] | | Why not just disallow PRs from non-vetted contributors? Why not just disallow issues without a vetting process? Many of these things could be explored -- you're right: it's a spam issue. But we have solutions to spam issues ... filters. LLMs have shown that "praying for the best" with permissive repository settings is not sufficient. We can and will improve our filters, no? |
| |
| ▲ | exasperaited 18 hours ago | parent | prev [-] | | > Further: sitting down to discuss how your project will adapt to change is never a waste of time, I’m surprised you stated it like that. It is a waste of time for large-scale volunteer-led projects who now have to deal with tons of shit — when the very topic is "how do we fend off this stuff that we do not want, because our project relies on much deeper knowledge than these submissions ever demonstrate?" |
| |
| ▲ | micromacrofoot 19 hours ago | parent | prev [-] | | yeah we are getting lots of "I don't know how to do this and AI gave me this code that doesn't work, can you fix it" or "AI said it can do this" and the feature doesn't exist... some people will even argue and say "but AI said it doesn't take long, why won't you add it" | | |
| ▲ | exasperaited 18 hours ago | parent [-] | | It weaponises incompetence, carelessness and arrogance at every turn. AI, to me, is a character test: I'm regularly fascinated by finding out who fails it. For example, in my personal life I have been treated to AI-generated comms from someone that I would never have expected it from. They don't know I know, and they don't know that I think less of them, and I always will. | | |
| ▲ | danielbln 5 hours ago | parent [-] | | I will never judge someone for using AI, but I will absolutely judge anyone for lobbing slop at me. I define slop as low effort, non-vetted, fiest-try-output. |
|
|
|
|
|
| ▲ | Levitating 17 hours ago | parent | prev | next [-] |
| I am sure I am not the only one who thinks these micro-dependencies are worthless anyway. You'd be better off just listing the functions in a markdown file for people to copy over than ship an entire package for it. This isn't "small" open source, "small" would be something you put together in a week or weekend. These are like "micro" projects, where more work goes into actually publishing and maintaining the repository than actually writing the library. I like the approach C sometimes takes, with the "tiny header file" type of libraries. Though I guess that also stems from the lack of a central build system. |
| |
| ▲ | _heimdall 2 hours ago | parent | next [-] | | Vendoring dependencies is how I remember doing it for web projects pre-NPM. Find an open source, well tested library and copy the source into your project. You have to manually update for any releases you care about, but that is also an incentive to keep dependency count low. | |
| ▲ | llbbdd 11 hours ago | parent | prev | next [-] | | Why aren't those tiny header file libraries just part of the standard C library? Wait sorry, I don't mean that. I read too many bog-standard HN comments about NPM above. | |
| ▲ | eviks 11 hours ago | parent | prev [-] | | What's your copy& paste solution to security updates? | | |
| ▲ | 1718627440 6 hours ago | parent | next [-] | | The security maintenance of the ten lines of code I have read and copied into my code is the same as the ten lines of code next to it, that I have written myself. | |
| ▲ | immibis 6 hours ago | parent | prev [-] | | Does left-pad have security updates? You may as well ask what's the security update solution for Stack Overflow answers. |
|
|
|
| ▲ | ef2k 15 hours ago | parent | prev | next [-] |
| > I’m still trying to figure out what kinds of open source are worth writing in this new era Is there any upside to opensourcing anything anymore? Anything published today becomes training data for the next model, with no attribution to the original work. If the goal is to experiment, share ideas, or let others learn from the work, maybe the better default now is "source available", instead of FOSS in the classic sense. It gives people visibility while setting clearer boundaries on how the work can be used. I learned most of what I know thanks to FOSS projects so I'm still on the fence on this. |
| |
| ▲ | llbbdd 11 hours ago | parent | next [-] | | I keep seeing this attitude and I don't really understand it at all; there's no upside to publishing open source work because it might be utilized by more people, is that correct? Or is it the attribution? There are many many libraries I have used and continue to use and I don't know the author's internet handle or Christian name. Does that matter? Why? I have written a lot of code that my name is no longer attached to. I don't care and I don't know why anyone does. If it were valuable I would have made more money off of it in the first place, and I don't have the ego to just care that people know it's my code either. I want the things I do today to have an upside for people in the future. If that means I write code that gets incorporated into a model that people use to build things N number of years from now, that's great. That's awesome. Why the hell is that apparently so demotivating to some people? | | |
| ▲ | johnnyanmac 5 hours ago | parent [-] | | > there's no upside to publishing open source work because it might be utilized by more people, is that correct? I believe the perspective here is "I make code for fellow hackers to look into, critique, be educated on, or simply play with". If you see the hacker scene as a social one, LLM's are an awful black hole that sucks up everything around it and ruins this collaboration. Not to mention that the hacker scene was traditionally thought to be a rejection of what we now call "Big Tech". Corporate was free to grab the code, but it didn't matter much as long as the scene was kept. Now even that invisible social contract is broken. But I suppose if you're of a diehard FOSS mentality, "Free" means "Free". Free to be used to build, or destroy society at its whim. a hivemind to meld into and progress the overall understanding of science, for science's sake. I'll admit the last few years have had me questioning what I truly want to do within the on these two mentalities. |
| |
| ▲ | xgulfie 15 hours ago | parent | prev | next [-] | | Sorry but source-available is probably going to get slurped up for training data as well Microsoft already did this for all code in every public repo. | | |
| ▲ | pabs3 13 hours ago | parent [-] | | When are they going to start doing it for private repos too... | | |
| ▲ | johnnyanmac 5 hours ago | parent [-] | | I wuldn't discount it already happening. They do own the most popular code hosing repository, after all. |
|
| |
| ▲ | theturtlemoves 8 hours ago | parent | prev | next [-] | | You're not obligated to give away your mind for free. You're free to share, of course. But sharing implies reciprocity, a back and forth. The internet used to be like that, but if the environment changes, you adapt your behavior accordingly. In the long run I think it's time to starve the system from input until it's attitude reverts to reciprocal. It's not what I'd want, but it seems necessary. People learn from consequences, not from words alone | | |
| ▲ | immibis 6 hours ago | parent [-] | | Or use AGPL licenses. As a bonus, nobody in JavaScript checks licenses so you might be able to sue Amazon for money. |
| |
| ▲ | 1gn15 13 hours ago | parent | prev | next [-] | | Staying true to free software principles. It's unethical to publish nonfree code or binaries. | | |
| ▲ | llbbdd 11 hours ago | parent [-] | | Code is only useful if it's used. I could write a ton of code and be buried with it, or publish it for people (or AI software, or dolphins or aliens) to use. Who has the energy to have Anubis measure whether my code, or yours, is ethical enough? I'm going to die someday! |
| |
| ▲ | justarandomname 14 hours ago | parent | prev | next [-] | | This is kinda how I've felt for months. I don't have any interest in continuing existing open source projects and don't want to create any new ones. What's the point? All of my personal projects for the past few months have been entirely private, I don't even host them on Github anymore, I have a private Forgejo instance I use instead. I also don't trust any new open source project I stumble upon anymore unless I know it was started at least a year ago. | |
| ▲ | zahlman 3 hours ago | parent | prev [-] | | > It gives people visibility while setting clearer boundaries on how the work can be used. ... Because those would be respected? |
|
|
| ▲ | jasonjmcghee 41 minutes ago | parent | prev | next [-] |
| > it’s a future where we prize instant answers over teaching and understanding It doesn't have to be this way- coupled with verification (to mitigate hallucination), llms can help so much with personal education. It's a purely judgement free environment and for a lot of people that's incredibly meaningful. |
|
| ▲ | cactusfrog 19 hours ago | parent | prev | next [-] |
| This author assumes that open sourcing a package only delivers value if is added as a dependency. Publicly sharing code with a permissive license is still useful and a radical idea. |
| |
| ▲ | lanstin 19 hours ago | parent | next [-] | | Yeah if I find some (small) unmaintained code I need, I will just copy it (then add in my metrics and logging standards :) It shouldn't be a radical idea, it is how science overall works. Also, as per the educational side, I find in modern software ecosystem, I don't want to learn everything. Excellent new things or dominantly popular new things, sure, but there are a lot of branching paths of what to learn next, and having Claude code whip up a good enough solution is fine and lets me focus on less, more deeply. (Note: I tried leaving this comment on the blog but my phone keyboard never opened despite a lot of clicking, and on mastodon but hit the length limit). | |
| ▲ | LtWorf 7 hours ago | parent | prev | next [-] | | A copyleft license is much better because it ensures that it will remain open and in most cases large companies won't use it, making sure they will have to shell out the money to hire someone to do it instead. | |
| ▲ | positron26 19 hours ago | parent | prev | next [-] | | Yep. Even just sharing code with any license is valuable. Much I have learned from reading an implementation of code I have never run even once. Solutions to tough problems are an under-recognized form of generosity. This is a point where the lack of alignment between the free beer crowd and those they depend on is all too clear. The free beer enthusiast cannot imagine benefiting from anything other than a finished work. They are concerned about the efficient use of scarce development bandwidth without consciousness of why it is scarce or that it is not theirs to direct. They view solutions without a hot package cache as a form of waste, oblivious to how such solutions expedite the development of all other tools they depend on, commercial or free. | |
| ▲ | shevy-java 18 hours ago | parent | prev [-] | | I do agree with this, but there are some caveats. At the end of the day it is time people invest into a project. And that is often unpaid time. Now, that does not mean it has no value, but it is a trade-off. After about 14 years, for instance, I retired permanently from rubygems.org in 2024 due to the 100k download limit (and now I wouldn't use it after the shameful moves RubyCentral did, as well as the new shiny corporate rules with which I couldn't operate within anyway; it is now a private structure owned by Shopify. Good luck finding people who want to invest their own unpaid spare time into anything tainted by corporations here). |
|
|
| ▲ | BrenBarn 19 hours ago | parent | prev | next [-] |
| > Sure, you could use blob-util, but then you’d be taking on an extra dependency, with unknown performance, maintenance, and supply-chain risks. Use of an AI to write your code is also a form of dependency. When the LLM spits out code and you just dump it in your project with limited vetting, that's not really that different from vendoring a dependency. It has a different set of risks, but it still has risks. |
| |
| ▲ | cortesoft 18 hours ago | parent | next [-] | | Part of the benefit over a dependency is that the code added will (hopefully) be narrowly tailored to your specific need, rather than the generic implementation from a library that likely has support for unused features. Not including the unused features both makes the code you are adding easier to read and understand, but it also may be more efficient for your specific use case, since you don't have to take into account all the other possible use cases you don't care about. | | |
| ▲ | bloomca 17 hours ago | parent | next [-] | | But in a lot of cases you can't know all the dependencies, so you lean on the community trusting that a package solves the problem well enough that you can abstract it. You can pin the dependency and review the changes for security reasons, but fully grasping the logic is non-trivial. Smaller dependencies are fine to copy at first, but at some point the codebase becomes too big, so you abstract it and at that point it becomes a self-maintained dependency. Which is a fair decision, but it is all about tradeoffs and sometimes too costly. | | |
| ▲ | mkj 16 hours ago | parent [-] | | You'd get those benefits from traditional dependencies if you copy them in and never update. Is an AI dependency going to have the equivalent of "upstream fixes"? | | |
| ▲ | cortesoft 11 hours ago | parent [-] | | Probably? LLMs will train on fixes, then if you run the code through the LLM again to fix it. |
|
| |
| ▲ | lmm 10 hours ago | parent | prev [-] | | > Part of the benefit over a dependency is that the code added will (hopefully) be narrowly tailored to your specific need, rather than the generic implementation from a library that likely has support for unused features. In decent ecosystems there should be low or zero overhead to that. > Not including the unused features both makes the code you are adding easier to read and understand, but it also may be more efficient for your specific use case, since you don't have to take into account all the other possible use cases you don't care about. Maybe. I find generic code is often easier to read than specialised custom implementations, because there is necessarily a proper separation of concerns in the generic version. |
| |
| ▲ | nolanl 18 hours ago | parent | prev | next [-] | | Right, but you do avoid worries like "will I have to update this dependency every week and deal with breaking changes?" or "will the author be compromised in a supply-chain attack, or do a deliberate protestware attack?" etc. As for performance, a lot of npm packages don't have proper tree-shaking, so you might be taking on extra bloat (or installation cost). Your point is well-taken, though. | | |
| ▲ | rcxdude 17 hours ago | parent | next [-] | | You can avoid all those worries by vendoring the code anyway. you only 'need' to update it if you are pulling it in as a separate dependency. | |
| ▲ | KPGv2 16 hours ago | parent | prev [-] | | > you do avoid worries like "will I have to update this dependency every week and deal with breaking changes? This is not a worry with NPM. You can just specify a specific version of a dependency in your package.json, and it'll never be updated ever. I have noticed for years that the JS community is obsessed with updating every package to the latest version no matter what. It's maddening. If it's not broke, don't fix it! |
| |
| ▲ | danelski 16 hours ago | parent | prev | next [-] | | Wouldn't call it a risk in itself, but part of the benefit of using a library, a good and tailored one at least, is that it'll get modernised without my intervention. Even if the code produced for you was state-of-the-art at the moment of inclusion, will it remain that way 5 years from now? | |
| ▲ | ronbenton 18 hours ago | parent | prev [-] | | > and you just dump it in your project with limited vetting Well yes there’s your problem. But people have been doing this with random snippets found on the internet for a while now. The truth is that irresponsibles developers will produce irresponsible code, with or without LLMs | | |
| ▲ | fullofideas 18 hours ago | parent [-] | | > The truth is that irresponsibles developers will produce irresponsible code, with or without LLMs
True. But the difference is the scale and ease of doing with code generators. With a few clicks you can add hundreds of lines of code which supposedly does the right thing. While in the past, you would get code snippets for a particular aspect of the problem that you are trying to solve. You still had to figure out how to add it to your code base and somehow make it “work” | | |
| ▲ | ninalanyon 17 hours ago | parent [-] | | Surely in any responsible development environment those hundreds of lines of code still have to be reviewed. Or don't people do code review any more? I suppose one could outsource the code review to an AI, preferably not the one that wrote it though. But if you do that surely you will end up building systems that no one understands at all. | | |
| ▲ | fullofideas 17 hours ago | parent [-] | | Agree. Any reasonable team should have code reviews in place, but an irresponsible coder would push the responsibility of code quality and correctness to code reviewers. They were doing it earlier too, but the scale and scope was much smaller. |
|
|
|
|
|
| ▲ | stevage 4 hours ago | parent | prev | next [-] |
| >Many software developers will argue that asking a candidate to reverse a binary tree is pointless Is "reversing a binary tree" actually a thing, or is this a cute kind of "rocket surgery" phrase intentionally mixing reversing a linked list and searching a binary tree? |
| |
| ▲ | nitnelave 3 hours ago | parent | next [-] | | I think it's a reference to the Google interview problem that the author of Homebrew (IIRC) failed. They were quite upset about it since they have proved their worth through their famous open-source contributions, but got rejected in a LeetCode-like interview. | |
| ▲ | stingraycharles 3 hours ago | parent | prev [-] | | It’s probably a mistake on the author’s end, but the problem comes across anyway. I can only imagine reversing a binary tree would imply changing the “<“ comparison in nodes to “>” which would be a useless exercise |
|
|
| ▲ | hnlmorg 19 hours ago | parent | prev | next [-] |
| More likely, what we will see is the decline of low effort projects. The JavaScript/ Typescript ecosystem has been plagued with such packages. But that’s more anomalous to the JS community than it is a systemic problem with open source in general. So if fewer people are including silly dependencies like isEven or leftPad, then I see that as a positive outcome. |
|
| ▲ | marcus_holmes 15 hours ago | parent | prev | next [-] |
| I don't buy the education angle. If you're not learning to code, then you want efficient code, so the comments are wasted bytes (ok, not a huge expense, but still). If you are learning to code, or just want to understand how this code works, then asking an LLM is going to get a lot better result. LLMs are fantastic tutors. Endlessly patient, go down any rabbit hole with you, will continue explaining a concept until you get it, etc. I think they're going to revolutionise education, especially for practical subjects like coding. Respect to the author for trying to educate, though. |
| |
| ▲ | joeblubaugh 14 hours ago | parent [-] | | > so the comments are wasted bytes Is there any modern compiler where the output code has anything to do with the comments in the source? |
|
|
| ▲ | Waterluvian 11 hours ago | parent | prev | next [-] |
| AI offering the solution for a small problem that probably doesn’t deserve yet another dependency suggests to me that there’s a middle ground that we’ve failed to sufficiently cover: how to socialize code snippets that you’re meant to just inline into your project. Stack Overflow is probably the closest we’ve gotten to a generalized solution and it doesn’t exactly feel like a good one. I came across this once before in the form of a react hooks library that had no dependency to install. It was just a website and when you found the hook you wanted you were meant to paste it into your project. |
| |
| ▲ | jcgl 4 hours ago | parent [-] | | GitHub gists are probably the best technical option. They come with versioning (I think), and have comment threads. | | |
| ▲ | Waterluvian 2 hours ago | parent [-] | | And a decent API as well. I think what’s missing is some amount of organization to make them more discoverable. |
|
|
|
| ▲ | tanin 12 hours ago | parent | prev | next [-] |
| Right now I tend to not use an external library unless the code would be large e.g. http server and/or the library is extremely popular. Otherwise, writing it myself is much better. It's more customizable and often much smaller in size. This is because the library has to generalize, and it comes with bloats. Using AI to help write is great because I should understand that anyway whether AI writes it or not or whether it's in an external library. One example recently is that I built a virtual list myself. The code is much smaller and simpler compared to other popular libraries. But of course it's not as generalizable. |
|
| ▲ | meken 15 hours ago | parent | prev | next [-] |
| > ….but I do think it’s a future where we prize instant answers over teaching and understanding It depends. For stuff I don’t care about I’m happy to treat it as a black box. Conversely, AI now allows me to do deep dive on essentially anything I’m interested in, which has been a massive boon to my learning ability. |
|
| ▲ | CuriouslyC 20 hours ago | parent | prev | next [-] |
| Small open source is still valuable, but the bar is higher. If your project is something that's trivial and nobody just thought to do it before you and bothered to do it after, that's probably not going to survive, but if your project is a small focused tool that handles something difficult really well, it's 100% got a future. |
| |
| ▲ | inferiorhuman 13 hours ago | parent [-] | | Earlier this year I wrote an interpreter for a niche, proprietary binary format. Someone asked if I could open source it so that they could (more easily) run it on NixOS. I declined as I'm just that strongly opposed to my work being used to train AI models and further entrench the enshittification of the internet. | | |
| ▲ | m000 4 hours ago | parent [-] | | Curious: So you only release your interpreter as a binary? Is there some license that explicitly forbids use for AI training? (Although it would probably be unclear if that is really enforceable until someone challenges it.) |
|
|
|
| ▲ | eviks 11 hours ago | parent | prev | next [-] |
| I suppose some people would see this as progress: fewer dependencies, more robust code (even if it’s a bit more verbose), quicker turnaround time than the old “search npm, find a package, read the docs, install it” approach. Why would randomized code be more robust?
Also, how is searching/reading the docs slower than checking the correctness of a randomized function? |
|
| ▲ | w10-1 18 hours ago | parent | prev | next [-] |
| TLDR: [AI promises] a future where we prize instant answers over teaching and understanding But what this article and the comments don't say: open-source is mainly a quality metric. I re-use code from popular open-source repo's in part because others have used it without complaints (or document the bugs), in part because people are embarrassed to write poor-quality open-source so it's above-par code, and in part because if there are issues in this corner of the world, this dependency will solve them over time (and I can watch and wait when I don't have time to fix and contribute). The quality aspect drives me to prefer dependencies over AI when I don't want full ownership, so I'll often ask AI to show open-source projects that do something well. (As an aside, this article is about AI, but AI is so pervasive now as an issue that it doesn't even need saying in the title.) |
|
| ▲ | hamdouni 8 hours ago | parent | prev | next [-] |
| The main point isn't about dependencies but loosing the mindset to learn from small domain problem |
|
| ▲ | kunley 7 hours ago | parent | prev | next [-] |
| The level of apologetism with regard to ai is depressing in this article. |
|
| ▲ | zkmon 19 hours ago | parent | prev | next [-] |
| Open source exists because coding was a significant effort and code was a thing of high value. Unsurprisingly companies hesitated to make the code public and free. All of this is changing now as coding has suddenly become trivial. So, yes, the mission of open source, in general, will be challenged. |
|
| ▲ | rglover 15 hours ago | parent | prev | next [-] |
| > Even now there’s a movement toward putting documentation in an llms.txt file, so you can just point an agent at it and save your brain cells the effort of deciphering English prose. (Is this even documentation anymore? What is documentation?) I look at it as why not have the best of both worlds? The docs for my JS framework all have the option of being returned as LLM-friendly text [1]. When I utilize this myself, it's to get help fleshing out skeleton code inside of an app built w/ my framework (e.g, Claude Sonnet w/ these docs in context build a nearly ~90-100% accurate implementation for most stuff I throw at it—anything from little lib stuff up to full-blown API endpoint design and even framework-level implementations of small stuff like helping to refactor the built-in middleware loading). It's not for a lack of desire to read, but rather a lack of desire to build every little thing from scratch when I know an LLM is perfectly capable (and much faster than me). [1] https://docs.cheatcode.co/joystick/ui/component/dynamic-page... |
|
| ▲ | smcameron 18 hours ago | parent | prev | next [-] |
| In the U.S., anything machine generated is uncopyrightable. Why would you put uncopyrightable code into your codebase? |
| |
| ▲ | gpm 18 hours ago | parent | next [-] | | Why wouldn't you? Your codebase (if you're a business) exists to make you money, people being able to copy some unknown portions of it without further license if they somehow legally get their hands on a copy of it seems entirely irrelevant. PS. I think this is much less clear and much less settled law than you are suggesting. | |
| ▲ | ebiester 18 hours ago | parent | prev | next [-] | | It's more nuanced. If I even have a few lines I can prove are mine, those parts are copywritable in the same way Pride and Prejudice is public domain but pride and prejudice and zombies is copyrighted. | |
| ▲ | siliconpotato 18 hours ago | parent | prev | next [-] | | Even worse...unmaintained code. Only the human-written one has a maintainer. The other one plagiariased by AI is instant legacy code | | |
| ▲ | exasperaited 18 hours ago | parent [-] | | > The other one plagiariased by AI is instant legacy code I have used this "instant legacy code" concept before. It's absolutely true, IMO. But people really, really, really hate hearing it. |
| |
| ▲ | hdgvhicv 18 hours ago | parent | prev | next [-] | | Autocomplete has been around for decades | |
| ▲ | ranger_danger 16 hours ago | parent | prev [-] | | I think if you dig a little deeper you will find that the answer is not so black and white. |
|
|
| ▲ | matheusmoreira 15 hours ago | parent | prev | next [-] |
| > I don’t know which direction we’re going in with AI Maybe programming languages will be designed for AIs in the future. Maybe they'll have features that make grafting unknown generated code easier. |
|
| ▲ | strogonoff 8 hours ago | parent | prev | next [-] |
| Chances are, even if you deliberately and strategically pick to work on an OSS project that you are positively sure an LLM can’t just spit out on command, it will be capable of doing so by the time you are close to completion. In that sense, one has to either be not inclined to question “what’s the point” or have a bit of gambling mentality in order to work on anything substantial. That’s not automatically a problem, however. The problem is that even if you do come up with a really cool idea that LLM is not capable of autocompleting, and you release it under a copyleft license (to ensure the project survives and volunteer contributor’s work is not adopted and extinguished by some commercial interest), it will get incorporated into its dataset regardless of the licensing, and thereafter the LLM will be capable of spitting it out and its large corporate operator will be able to monetise your code (allowing anyone with money wishing to build a commercial product based on it). |
|
| ▲ | anonzzzies 16 hours ago | parent | prev | next [-] |
| I would have and did write this instead of including it anyway. These small npms you would spend more time to look up than write are a pest. |
|
| ▲ | meken 15 hours ago | parent | prev | next [-] |
| It would be a cool feature of AI to include only the subset of the library you use and nothing else. |
|
| ▲ | levkk 19 hours ago | parent | prev | next [-] |
| Several issues: 1. Reducing dependencies is a wrong success metric. You just end up doing more work yourself, except you can't be an expert in everything, so your code is often strictly worse. 2. Regenerating the same solutions with a probabilistic machine will produce bugs a certain percentage of the time. Dependencies are always the same code (when versioned). 3. Cognitive overhead for human review is higher with LLM-generated libs, for no additional benefit. |
| |
| ▲ | jcelerier 19 hours ago | parent [-] | | > Reducing dependencies is a wrong success metric. You just end up doing more work yourself Except it's just not true in many cases because of social systems we've built. If I want to ship software to Debian I have to make sure that every single of my 3rdparty dependencies is registered and packaged as a proper debian package - a lot of time it will take much less work to rewrite some code than to get 25 100-lines-of-code micro-libraries accepted into debian. |
|
|
| ▲ | willtemperley 7 hours ago | parent | prev | next [-] |
| LLMs give us the opportunity to work on more complex projects and gain fuller understanding of the problem space and concepts. Or create tons of slop. Take your pick. |
|
| ▲ | shevy-java 18 hours ago | parent | prev | next [-] |
| > Claude’s version is pretty close to the blob-util version (unsurprising, since it was probably trained on it!). AI are thieves! > I don’t know which direction we’re going in with AI (well, ~80% of us; to the remaining holdouts, I salute you and wish you godspeed!), but I do think it’s a future where we prize instant answers over teaching and understanding. Google ruined its search engine years ago before AI already. The big problem I see is that we have become WAY too dependent on these mega-corporations. Which browser are people using? Typically chrome. An evil company writes the code. And soon it will fire the remaining devs and replace them with AI. Which is kind of fitting. > Even now there’s a movement toward putting documentation in an llms.txt file, so you can just point an agent at it and save your brain cells the effort of deciphering English prose. (Is this even documentation anymore? What is documentation?) Documentation in general sucks. But documentation is also a hard problem. I love examples. Small snippets. FAQs. Well, many projects barely have these. Look at ruby webassembly/wasm or ruby opal. Their documentation is about 99% useless. Or, even worse - rack in ruby. And I did not notice this in the past, in part because e. g. StackOverflow still worked, there were many blogs which helped fill up missing information too. Well all of that is largely gone now or has been slurped up by AI spam. > the era of small, low-value libraries like blob-util is over. They were already on their way out thanks to Node.js and the browser taking on more and more of their functionality (see node:glob, structuredClone, etc.), but LLMs are the final nail in the coffin. I still think they have value, but looking at organisations such as rubygems.org disrupt the ecosystem and bleeding it dry by kicking out small hobbyists, I think there is indeed a trend towards eliminating the silly solo devs who think their unpaid spare time is not worthy of anything at all, yet the big organisations eagerly throw down more and more restrictions onto them (my favourite example is the arbitrary 100k download limit for gems hosted at rubygems.org, but look at the new shiny corporate rules on rubygems.org - this is when corporations take over the infrastructure and control it. Ironically this also happened to pypi and they admit this indirectly: https://blog.pypi.org/posts/2023-05-25-securing-pypi-with-2f... - of course they deny that corporations control pypi now, but by claiming otherwise they admit it, because this is how hobbyists get eliminated. Just throw more and more restrictions at them without paying them. Sooner or later they decide to do something better with their time.) |
|
| ▲ | cynicalsecurity 20 hours ago | parent | prev [-] |
| He almost got it right. It's not just the fate of small open source. It's the fate of all programmers now. Why hire a programmer when an LLM costs less, works faster and makes less mistakes (OP compliments better error handling, read the article). Unless you are a product owner, you have paying clients that love you and your product and won't simply ditch it in favour of a new clone, you are really screwed. |
| |
| ▲ | vanschelven 20 hours ago | parent | next [-] | | "when an LLM costs less, works faster and makes less mistakes"... indeed, but it doesn't follow at all that it's the fate of all programmers _now_... at least in my experience none of these things are true ATM. | | |
| ▲ | PunchyHamster 19 hours ago | parent [-] | | Well, at the very least it costs less than asking intern to look for a lib doing something particular and give some examples... still about as accurate as the intern tho. | | |
| ▲ | skydhash 19 hours ago | parent [-] | | How many time has it happened for a company to actually ask an intern for a library? | | |
| ▲ | withinboredom 15 hours ago | parent [-] | | Um, isn't this what an intern is for? Or do you let them try to contribute to your core project? |
|
|
| |
| ▲ | dakiol 19 hours ago | parent | prev | next [-] | | So far I've never seen yet a non-programmer release production-grade code using only LLMs. There's just so much to care about (from security, deployments, secret management, event-driven architectures, and a large etc.) that "just" providing a prompt to create an "app" doesn't cut it. You need infra and non-engineers just don't know shit about infra (even if it's 99% managed), you need to deploy your llm-generated code in that infra; that should happen in a ci/cd probably. And what about migrations? Git? Who's setting up the api gateway? I don't mean to say that LLMs don't know how to do that, but you need to instruct them to do so, and even there, they will make silly mistakes and you need to re-instruct them or fix it. Prompting is just 50% of the work (and the easy part actually). Ask the Head of Product or whoever is there to deploy something valuable to production and maintain it for 6 months while not losing money. It's just not going to happen, not even with truly AGI. | |
| ▲ | 59nadir 19 hours ago | parent | prev | next [-] | | An LLM might be able to replace the majority of the code Sindre Sorhus has put out there, but it's probably a stretch to think that it could replace someone like John Carmack. Trivial NPM libraries were never needed, but LLMs really are the nail in the coffin for them even when it comes to the most incompetent programmers because now they can literally just ask an LLM to spit out the exact same thing. | |
| ▲ | RhythmFox 20 hours ago | parent | prev [-] | | He also points out a pointless type check in a type checked language... Your name is very accurate I must say. | | |
| ▲ | jazzypants 19 hours ago | parent [-] | | That type check is honestly not pointless at all. You can never be certain of your inputs in a web app. The likelihood of that parameter being something other than an arraybuffer is non-zero, and you generally want to have code coverage for that kind of undefined behavior. TypeScript doesn't complain without a reason. |
|
|