Remix.run Logo
bilekas 7 hours ago

I have my worries about Internet Archive more and more recently.

I'm wondering though is there any decentralized IPFS or P2P Archive of the entire archive that can be helped with for preservation ?

https://www.wired.com/story/the-internets-most-powerful-arch...

sdellis 7 hours ago | parent [-]

According to the Wikipedia page, it seems that copies of the archive are stored around the world.

LOCKSS is a decentralized strategy for preservation which includes archival copies at remote sites. It has been in use for a very long time. I feel like preservation via IPFS would introduce quite a bit of risk to the goal.

bityard 6 hours ago | parent | next [-]

Yeah, there is some popular misunderstanding about what IPFS is... a lot of people seem to think its essentially free or subsidized distributed cloud storage. But the more you dig into it, the more you realize it's just a fairly inefficient caching system.

LOCKSS looks interesting but it seems like it's exclusively for libraries.

badlibrarian 6 hours ago | parent | prev [-]

I can find no current public document from the Internet Archive explaining what is backed up, where, or at what redundancy level.

From a 2016 blog post:

"Do you do backups too, for example to guard against corrupt data getting mirrored across both copies, or accidental deletion?"

John Gonzalez, the author and IA infrastructure lead, replied:

"We have done experiments to confirm that we can back up large portions of our corpus... but this is not a regular practice for us at this time."

https://blog.archive.org/2016/10/25/20000-hard-drives-on-a-m...