Remix.run Logo
BobbyTables2 4 hours ago

Is that even a lot?

Spread over a year, roughly estimating a generous 4 kbytes of data per commit, comes out to a throughput of a little under 2 MB/s.

Of course, it isn’t spread out uniformly and there is also a lot of hashing and other things going on.

Maybe pulls and clones drive more I/O ?

3eb7988a1663 4 hours ago | parent [-]

I suspect there is a cacophony of work that happens when a commit hits the server. That request needs to get replicated, git repositories need to be repacked, pull requests need to calculate diffs, CI jobs need to execute, on and on.

That's also just assuming the good-faith usage. There are probably plenty of adversarial and poorly behaved scrapers that are putting additional load on the system.

jamesfinlayson 3 hours ago | parent [-]

Recalculate percentage of each language in the repo, recalculate top contributors, recalculate the stats for the committer's profile etc etc.