Remix.run Logo
robotswantdata 4 hours ago

How does that work if the scraper takes a screenshot to feed to a LLM or OCR?

yummypaint 4 hours ago | parent [-]

That seems like a very expensive way to crawl the internet

robotswantdata 2 hours ago | parent [-]

Scrape normally collect emails, if no email seen take screenshot and OCR OCR is cheap and REGEX is cheap