Remix.run Logo
tehjoker 3 hours ago

How is this that different from a mixture of experts in a single model? There are some differences in training etc but it’s not that different at a fundamental level. You need to solve the issue with a single model.

The multiple model concept feels to me like a consumer oriented solution, its trying to fix problems with things you can buy off the shelf. It’s not a scientific or engineering solution.