Remix.run Logo
jrumbut a day ago

This is a great point. I'm kind of surprised there isn't a greater proliferation of open source models to do things the public ones won't. I know such things exist, but imagine how many web browsers there would be if all the mainstream ones had the same content restrictions as LLMs.

I guess since training them does take cash that raises the bar for what people will do as a prank or on principle.

adrian_b a day ago | parent [-]

Training is time-consuming and/or expensive but it is not the main blocker.

The main problem is obtaining a big enough training data set. Now, unless you are someone like Google or Microsoft, it has become much harder to scrap data from the Internet than by the time when OpenAI and Anthropic got most of their data.