| ▲ | Kiro 5 hours ago | |
I remember when you used to get scolded on HN for preventing scrapers or bots. "How I access your site is irrelevant". | ||
| ▲ | elashri an hour ago | parent | next [-] | |
I have a side project as an academic that scrape a couple of academic jobs sites in my field and then serve them in static HTML page. It is running using github action and request every 24 hours exactly one time. It is useful for me and a couple of people in my circle. I would consider this to be fine and within the reasonable expectations. Many projects rely on such scenarios and people share them all the time. It is completely different if I am hitting it looking for WordPress vulnerabilities or scraping content every minute for LLM training material. | ||
| ▲ | hollow-moe 2 hours ago | parent | prev | next [-] | |
There's this and that. "How I [i.e. an individual human looking for myself] access your site is irrelevant." and "How I [i.e. an AI company DDOSing (which is illegal in some places btw) trying to maximize profit and offloading cost to you] access your site is irrelevant." When you get paid big buck to make the world worse for everyone it's really simple forgetting "little details". | ||
| ▲ | Analemma_ 42 minutes ago | parent | prev [-] | |
To me that's the one of the most depressing developments about AI (which is chock-full of depressing developments): that its mere existence is eroding long-held ethics, not even necessarily out of a lack of commitment but out of practical necessity. The tech people are all turning against scraping, independent artists are now clamoring for brutal IP crackdowns and Disney-style copyright maximalism (which I never would've predicted just 5 years ago, that crowd used to be staunchly against such things), people everywhere want more attestation and elimination of anonymity now that it's effectively free to make a swarm of convincingly-human misinformation agents, etc. It's making people worse. | ||