90% of Claude-linked output going to GitHub repos w <2 stars

Aurornis an hour ago | parent | next [-]

Perfect example of a base rate fallacy - https://en.wikipedia.org/wiki/Base_rate_fallacy

What percentage of GitHub activity goes to GitHub repos with less than 2 stars? I would guess it's close to the same number.

▲

levocardia an hour ago | parent | next [-]

My reaction as well -- I have a few dozen public repos of 100% human-written code, most are 0 stars!

▲

nickcw an hour ago | parent | next [-]

The first thing I do when I make a new repo is star it myself ;-)

	▲	Imustaskforhelp an hour ago \| parent [-]
		https://knowyourmeme.com/memes/obama-awards-obama-a-medal

▲

sleepybrett 8 minutes ago | parent | prev [-]

I have a few dozen org repos, of course none of them have stars, who stars their corporate repos?

▲

tlogan an hour ago | parent | prev | next [-]

The actual number is that 98% have less than 2 stars (0 or 1). About 90.25% has zero stars.

▲

ZeWaka a minute ago | parent | next [-]

I think this is useful in answering the grandparent comment's question: ┌──stars─┬──uniq(k)─┐

│ 1 │ 14946505 │

│ 10 │ 1196622 │

│ 100 │ 213026 │

│ 1000 │ 28944 │

│ 10000 │ 1847 │

│ 100000 │ 20 │

└────────┴──────────┘

▲

wetoastfood an hour ago | parent | prev | next [-]

How do you know that?

	▲	tlogan an hour ago \| parent [-]
		https://ghe.clickhouse.tech/

▲

Bratmon 7 minutes ago | parent | prev [-]

Wait, you just answered a completely different question and pretended it was relevant!

I can play that game too: The average elephant eats 500 pounds of vegetation a day, therefore most AI interaction on Github is fake.

▲

ttul an hour ago | parent | prev | next [-]

Yeah. Most of my public repos have 0 stars. Most of what I write sucks.

	▲	Joel_Mckay 31 minutes ago \| parent [-]
		Yeah, but knowing something sucks means you are probably reasonably competent at coding. =3 https://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect

▲

runarberg 32 minutes ago | parent | prev [-]

There is still a sampling bias if you compare blanket human written repos. I would guess people are far more likely to share their homework assignments, experiments, hackathon results, weekend toys, etc. as a public repo if they put some amount of work into it. I would guess minority of those would get any stars at all. If the whole thing was generated by AI in less then 20 minutes, I would guess they are more likely to simply throw it away when they are done with it.

Personally I think comparing github stars is always going to be a fraught metric.

▲

madrox an hour ago | parent | prev | next [-]

Already enough comments about base rate fallacy, so instead I'll say I'm worried for the future of GitHub.

Its business is underpinned by pre-AI assumptions about usage that, based on its recent instability, I suspect is being invalidated by surges in AI-produced code and commits.

I'm worried, at some point, they'll be forced to take an unpopular stance and either restrict free usage tiers or restrict AI somehow. I'm unsure how they'll evolve.

▲

louiereederson 35 minutes ago | parent | next [-]

The instability is related to their Azure migration isn't it? Cynically you could say it hasn't been helped by the rolling RIFs at Microsoft

	▲	madeofpalk 16 minutes ago \| parent [-]
		Does anyone actually know? So far I've just seen people guessing, and seeing that repeated.

▲

hungryhobbit an hour ago | parent | prev | next [-]

Or they'll just keep forcing policies that let them steal the code you post on GitHub (for their AI training), and make everyone leave that way.

▲

phantomCupcake 35 minutes ago | parent | prev [-]

This.

But also, GitHub profiles and repos were at one point a window into specific developers - like a social site for coders. Now it's suffering from the same problem that social media sites suffer from - AI-slop and unreliable signals about developers. Maybe that doesn't matter so much if writing code isn't as valuable anymore.

▲

monster_truck 7 minutes ago | parent | prev | next [-]

I cannot understate how much of an improvement that is. If I had a dollar for all the shit I made myself, the old fashioned way, that got 0 attention at all? I'd have enough for a month or two of claude

▲

furyofantares 2 hours ago | parent | prev | next [-]

100% of all code I have put on github, using claude or not, is on repos with zero stars.

▲

ramoz an hour ago | parent | prev | next [-]

Shout out to Broadwayscore by thomaspryor@github

At 2mo old - nearly a 1GB repo, 24M loc, 52K commits

https://github.com/thomaspryor/Broadwayscore

Polished site:https://broadwayscorecard.com/

▲

mjr00 an hour ago | parent [-]

I was really confused how this could be possible for such a seemingly simple site but it looks like it's storing + writing many new commits every time there's a new review, or new financial data, or a new show, etc.

Someone might want to tell the author to ask Claude what a database is typically used for...

	▲	a-dub 34 minutes ago \| parent [-]
		json in git for reference data actually isn't terrible. having it with the code isn't great, and the repo is massively bloated in other ways, but for change tracking a source of truth, not bad except for maybe it should be canonicalized.

▲

throwaway27448 an hour ago | parent | prev | next [-]

Do people really put weight in stars? It seems completely unrelated to anything but, well, popularity. Even when I modify other peoples' code I fork to a private repo and maintain my changes separately, and I'm fairly certain I have never starred a repo.

	▲	thorum an hour ago \| parent \| next [-]
		Stars have been useless as signals for project quality for a while. They’re mostly bought, at this point. I regularly see obviously vibe-coded nonsense projects on GitHub’s Trending page with 10,000 stars. I don’t believe 10,000 people have even cloned the repo, much less gotten any personal value from it. It’s meaningless.
	▲	ianbutler 21 minutes ago \| parent \| prev \| next [-]
		Maybe not to devs, but I've had VCs ask about them because of popularity so there you go it's a signal to someone. Whatever reaction you have to this know that my internal reaction and yours were probably close.
	▲	robarr an hour ago \| parent \| prev \| next [-]
		For example, it's used as a kind of internal bookmarking system. I don't necessarily star a repo because I think it has good code, but maybe a good idea or something related to something I'm interested in developing.
	▲	zadikian an hour ago \| parent \| prev [-]
		I've seen people "buy" stars enough not to look at them so closely. Maybe will consider whether it has 0-1 or 2-2M.

▲

xnyan 2 hours ago | parent | prev | next [-]

I have many GH repos, most have no stars. Probably because most of what I write is not very useful to other people due to quality or use case. I would say this is true of most fully human-created repos on GitHub.

▲

maxbeech an hour ago | parent | prev | next [-]

the more interesting signal in that data is about intent, not quality. most of these low-star repos probably aren't failed open source attempts - they're personal tools that were never meant to be shared.before ai-assisted coding, the effort-to-build ratio was high enough that most personal scripts stayed on a laptop or in a private gist. pushing to a public repo implied an implicit claim that someone else might want this. now the build cost is low enough that people just push things to git for their own version history and move on.what's actually happening is that git is becoming a personal dev journal as much as a collaboration platform. stars were always a weak proxy for value, but they're especially wrong for this use case.the 90% number probably also undercounts the real extent of this - most serious claude code usage is on private repos and internal tooling that never touches public github at all. the 50b lines stat would look very different if you could see total token output vs just github-public-linked output.

	▲	phantomCupcake an hour ago \| parent [-]
		It would be very interesting to see how much of this is the "audience of one" type of project - i.e. personal scripts - vs new developers/vibe coders trying to start an app. I have definitely been surprised by the scale of some of the repos that seem to be vibe-coded. People who seem to have no history in development are building game engines, and payroll systems, and Broadway review websites. Unfortunately that type of analysis would take a bit more work, but I think the repo info and commit messages could probably be used to do that.

▲

hk1337 20 minutes ago | parent | prev | next [-]

How long does it normally take projects to get stars though? You're not going to have a project with 100+ stars overnight or even within a month, you have to promote the project?

	▲	JanisErdmanis 15 minutes ago \| parent [-]
		Depends widely on the target audience. In my case, targeting Julia developers who want to package their applications into installers to reach 100 stars took 2 years - https://peacefounder.org/AppBundler.jl. If I were to target Python developers, I would have many more stars.

▲

embedding-shape 2 hours ago | parent | prev | next [-]

I'd betcha a lot more than 90% goes to repositories without any stars at all, or even public code!

	▲	phantomCupcake an hour ago \| parent [-]
		Absolutely! I think the real stats will far exceed what we can see on public GitHub. That said, going through some of the top "performers" by commit and line count - I am surprised by how many people have all their code in public repos.

▲

mikkupikku an hour ago | parent | prev | next [-]

Maybe because people are using claude to to write code for themselves, to scratch their own itch, and upload it to the world just because. The value of code can't be measured in star counts.

▲

chrisweekly an hour ago | parent | prev | next [-]

Even if that stat were compared directly to the base rate (human output), it could easily be explained by correlating strongly with Claude usage skewing towards new repos.

▲

dev_l1x_be an hour ago | parent | prev | next [-]

Did we democratise software engineering? Seriously, I created a bunch of tools that I find useful without the bloated framework issues that are present in software nowadays. Jokes on me if something does not work.

▲

anon7000 an hour ago | parent | prev | next [-]

The HN headline is at least misleading, because I suspect a majority of Claude usage is at the enterprise level (deep pockets), which goes to private GitHub repos.

▲

theteapot 37 minutes ago | parent | prev | next [-]

Why is this interesting?

	▲	Joel_Mckay 20 minutes ago \| parent [-]
		The LLM content piracy to isomorphic plagiarism business loop is unsustainable. Yet for context search it is reasonably useful. =3 https://www.youtube.com/watch?v=T4Upf_B9RLQ

▲

tombert 2 hours ago | parent | prev | next [-]

I mean, most of the code that I have written to Github with normal human intelligence also goes to Github repos will less than two stars. They're usually repos that I create and no one else touches.

	▲	moritzwarhier 2 hours ago \| parent [-]
		I'd even say the opposite outcome would be far more concerning... for GitHub's concept of free hosting for OSS git repositories as a form of Social Media. Their rent depends on star signals, so to speak.

▲

Computer0 an hour ago | parent | prev | next [-]

I have a star on one of my repos. Almost all of my work is only relevant to me or is internal to my org.

▲

user3939382 2 hours ago | parent | prev | next [-]

At a glance this may read as “most of this code isn’t valuable to others” but reality is probably complected with “this type of code is reducing the need for shared libraries”.

▲

Vektorceraptor 2 hours ago | parent | prev | next [-]

guilty :) 1 Star here - and even that is worthless

▲

louiereederson 4 hours ago | parent | prev | next [-]

Toggling the stars shows 50b lines of code created across all projects, only 5b on projects with 2+ stars since Claude Code launch. Kind of eye opening where these Claude Code tokens are going.

Came across this from this ShowHN post yesterday https://news.ycombinator.com/item?id=47501348

▲

phantomCupcake an hour ago | parent [-]

Thanks for starting the conversation and sharing my dashboard. :)

▲

louiereederson 20 minutes ago | parent [-]

I hope you don't mind, I thought this was a really valuable dashboard.

	▲	phantomCupcake 16 minutes ago \| parent [-]
		Not at all! The ShowHN didn't really get a lot of feedback but this thread has already given me a lot to think about adding/improving.

▲

mrlonglong 24 minutes ago | parent | prev [-]

Codeberg if you hate AI.