Remix.run Logo
mtndew4brkfst 4 hours ago

What is the specific concrete purpose of downloading millions of URLs per hour across different domains if it's "not doing anything wrong"?

decide1000 3 hours ago | parent | next [-]

Mostly ecommerce and pricing data. I work for marketplaces, brands, retail stores and even our own saas competitors. We match the EAN (gtin) to the correct SKU within seconds (Google Shopping, Amazon, etc). Part of it is our own trained ML models.

big-and-small 4 hours ago | parent | prev [-]

Might be it for scrapping content for training an LLM? Oh no only big tech allowed to do it...