Remix.run Logo
larodi 7 hours ago

after 25 years wikipedia showed what it truly was created for, by selling the content for training. otherwise - okay, this was a cool project, perhaps we need better. like federated, crypto-signed articles that once collected together, @atproto style, produce the article with notable changes to it.

RestartKernel 6 hours ago | parent | next [-]

Their enterprise offering is more for fresh retrieval than training. For training, you can just download the free database dump — one you would inadvertently end up recreating if you were to use their enterprise APIs in a (pre-)training pipeline.

armchairhacker 6 hours ago | parent | prev | next [-]

Context: https://arstechnica.com/ai/2026/01/wikipedia-will-share-cont...

tl;dr: Wikipedia is CC and has public APIs, but AI companies have recently started paying for "enterprise" high-speed access.

Notably, the enterprise program started in 2021 and Google has been paying since 2022.

giuliomagnifico 7 hours ago | parent | prev [-]

You’re saying Wikipedia was created 25 years ago to sell its content to train LLMs that didn’t even exist?! I doubt it…

littlestymaar 6 hours ago | parent [-]

“Jimmy Wales is even more of a visionary than we thought”