| ▲ | oblio 3 hours ago | |
This is super hilarious :-))) Do you think creating the orders of magnitude of content the internet produced organically and which LLM creators are stealing is cheap? If they actually have to pay for content creation while competing with content creators on the you know, content creation front via LLM-generation, the entire business model of LLMs collapses. You can't have the mountains of data needed for LLMs in the decades to come, if your LLMs put the writers and artists out of work. | ||
| ▲ | aspenmartin an hour ago | parent [-] | |
It’s literally how these models are trained today. They of course use open source data but that’s no longer the most important source, it’s high quality prompts and verifiable tests and a lot of inference compute. They also have massive flywheels from users from which they can mine good data or at the very least again good prompts which can be just as important. | ||