| ▲ | ffsm8 3 hours ago | |
I'm not an industry insider and not the source of this fact, but it's been previously stated that traffic costs to fetch the current data for each training run is cheaper then caching it in any way locally - wherever it's a git repo, static sites or any other content available through http | ||
| ▲ | pm215 3 hours ago | parent [-] | |
This seems nuts and suggests maybe the people selling AI scrapers their bandwidth could get away with charging rather more than they do :) | ||