| ▲ | jqpabc123 5 hours ago | |
Dumb idea --- how about if we limit local models to specific domains --- medicine for example. Most doctors don't care much about engineering or accounting or software development or 10000 other things that big vendor models address. This area is yet to be really explored. Nvidia aims to provide the hardware to do so. | ||
| ▲ | CamperBob2 an hour ago | parent [-] | |
That's a fairly obvious idea, and unfortunately it doesn't seem to pan out. Trying to specialize an LLM in one area harms its 'cognition' in all areas. For instance, if you train a coding model without all the Shakespeare and soap operas and Wikipedia and pirated Stephen King books and ancient Roman history and whatever, you end up with a worse coding model. I'm not sure anyone really understands why. | ||