|
| ▲ | Gigachad 11 hours ago | parent | next [-] |
| This stuff tends to happen by accident. Some org has an app that automatically downloads the dataset if it's missing, helpful for local development. Then it gets loaded in to CI, and no one notices that it's downloading that dataset every single CI run. |
|
| ▲ | stevage 9 hours ago | parent | prev | next [-] |
| Let's say you're working on an app that incorporates some Italian place names or roads or something. It's easy to imagine how when you build the app, you want to download the Italian region data from geofabrik then process it to extract what you want into your app. You script it, you put the script in your CI...and here we are: > Just the other day, one user has managed to download almost 10,000 copies of the italy-latest.osm.pbf file in 24 hours! |
|
| ▲ | ahlCVA 8 hours ago | parent | prev | next [-] |
| Whenever people complain about the energy usage of LLM training runs I wonder how this stacks up against the energy we waste by pointlessly redownloading/recompiling things (even large things) all the time in CI runs. |
|
| ▲ | comprev 5 hours ago | parent | prev | next [-] |
| Optimising CI pipelines has been a strong aspect of my career so far. Anybody can build a pipeline to get a task done (thousands of quick & shallow howto blog posts) but doing this efficiently so it becomes a flywheel rather than a blocker for teams is the hard part. Not just caching but optimising job execution order and downstream dependencies too. The faster it fails, the faster the developer feedback, and the faster a fix can be introduced. I quite enjoy the work and always learning new techniques to squeeze extra performance or save time. |
|
| ▲ | raverbashing 9 hours ago | parent | prev | next [-] |
| Also for some reason, most CI runners seem to cache nothing except for that minor thing that you really don't want cached. |
| |
| ▲ | bombcar 5 hours ago | parent [-] | | This is exactly it - you can cache all the wrong things easily, cache only the code you wanted changed, or cache nothing but one small critical file nobody knows about. No wonder many just turn caching entirely off at some point and never turn it back on. |
|
|
| ▲ | mschuster91 9 hours ago | parent | prev [-] |
| CI itself doesn't have to be a waste. The problem is most people DGAF about caching. |