training models with scraped content vs scraping output from trained models is completely different. the output is not the original scraped content. it is synthesized