▲ | pbd 7 days ago | ||||||||||||||||
From a systems design perspective, $3,000 per book makes this approach completely unscalable compared to web scraping. It's like choosing between a O(n) and O(n²) algorithm - legally compliant data acquisition has fundamentally different scaling characteristics than the 'move fast and break things' approach most labs took initially. | |||||||||||||||||
▲ | BoorishBears 7 days ago | parent | next [-] | ||||||||||||||||
I don't know if anyone has actually read the article or the ruling, but this is about pirating books. Anthropic went back and bought->scanned->destroyed physical copies of them afterward... but they pirated them first, and that's what this settlement is about. The judge also said: > “The training use was a fair use,” he wrote. “The technology at issue was among the most transformative many of us will see in our lifetimes.” So you don't need to pay $3,000 per book you train on unless you pirate them. | |||||||||||||||||
| |||||||||||||||||
▲ | thfuran 6 days ago | parent | prev | next [-] | ||||||||||||||||
Isn't a flat price per book quite plainly O(n)? If not, what's n? | |||||||||||||||||
▲ | whimsicalism 7 days ago | parent | prev [-] | ||||||||||||||||
more of a large difference in constant factor, like a galactic algorithm for data trawling |