▲ | keiferski 2 days ago | |
What is the long term plan for data acquisition? 1. Use existing websites for training data 2. Replace search traffic with AI prompts, thereby destroying the economic incentive for websites to publish data 3. ? | ||
▲ | ethan_smith 2 days ago | parent [-] | |
Step 3 is "train on synthetic data generated by previous AI models" - creating an inevitable quality death spiral as each generation trains on increasingly derivative content. |