Remix.run Logo
jdiff 8 hours ago

If they were any good at it, they'd have blocked the Internet Archive via robots.txt. For some inexplicable reason, IA responds to that by wiping out past, present, and future archivals of that site. They haven't taken that easy step, so I doubt they'd go the further, more involved step of focusing on this smaller actor.

codedokode 7 hours ago | parent | next [-]

IA also blocks some content in Russia, for example this [1] says: "This URL has been excluded from the Wayback Machine in your region.". I was sincerely surprised to learn that while not paying much attention to US copyright law, they have high respect for messages from Vladimir.

(in case someone is curious what about is that article, it is a fictional comparison of life of a fictional character in Springfield, USA and Chusovoy, USSR in 80s and I cannot even understand why it was banned in Russia)

[1] https://web.archive.org/web/20250418160713/https://habr.com/...

wl 7 hours ago | parent | prev [-]

The Wayback Machine has ignored robots.txt for a few years at this point. The only way to get them to stop scraping or remove content is by asking them directly.