Remix.run Logo
tardedmeme 7 hours ago

They use different user-agent strings. The crawlers obfuscate themselves and use residential proxies. The agents call themselves ChatGPT-User. Of course Cloudflare wants OpenAI to pay them for not blocking ChatGPT-User by default.

faangguyindia 7 hours ago | parent [-]

It's true, crawlers used for AI training don't say they are crawlers at all.