Remix.run Logo
decide1000 4 hours ago

With scraper tech I mean a rust binary that is able to download and process thousands concurrent urls (millions per hour). Not to the same domain obviously. Paying more is not the issue here, its more the idea that an AI decides on what part of the spectrum I operate. Why is it opinionated? I am not doing anything wrong, why does it make me feel like I have to defend myself.

mtndew4brkfst 4 hours ago | parent [-]

What is the specific concrete purpose of downloading millions of URLs per hour across different domains if it's "not doing anything wrong"?

decide1000 3 hours ago | parent | next [-]

Mostly ecommerce and pricing data. I work for marketplaces, brands, retail stores and even our own saas competitors. We match the EAN (gtin) to the correct SKU within seconds (Google Shopping, Amazon, etc). Part of it is our own trained ML models.

big-and-small 4 hours ago | parent | prev [-]

Might be it for scrapping content for training an LLM? Oh no only big tech allowed to do it...