| ▲ | Ronsenshi 8 hours ago | |||||||||||||
One thing about Google is that many anti-scraping services explicitly allow access to Google and maybe couple of other search engines. Everybody else gets to enjoy CloudFlare captcha, even when doing crawling at reasonable speeds. Rules For Thee but Not for Me | ||||||||||||||
| ▲ | chii 8 hours ago | parent | next [-] | |||||||||||||
> many anti-scraping services explicitly allow access to Google and maybe couple of other search engines. because google (and the couple of other search engines) provide enough value that offset the crawler's resource consumption. | ||||||||||||||
| ||||||||||||||
| ▲ | ehhthing 7 hours ago | parent | prev | next [-] | |||||||||||||
You say this like robots.txt doesn't exist. | ||||||||||||||
| ||||||||||||||
| ▲ | ErroneousBosh an hour ago | parent | prev [-] | |||||||||||||
Why are you scraping sites in the first place? What legitimate reason is there for you doing that? | ||||||||||||||