| ▲ | Kirby64 an hour ago | |||||||
Why are you representing this as such a binary here? For SLM we don’t need the Taalas stuff at all. Just run it locally on your own device if it’s truly a small model. And there’s plenty of larger models that can be run on-premise just fine. I think it’s impressive that a frontier model can achieve 750t/s. That’s all. You can get similar insane token speeds from other open weight models too. | ||||||||
| ▲ | windexh8er an hour ago | parent [-] | |||||||
The irony here is, according to you, my take is the binary one. When your response is: well, we can all just run it on our devices - we don't need any other options! You seem to be cool with a very small and gated ecosystem with whatever tech billionaires want you to have access to. I grew up in the era where compute was diverse and open. You may think this is OK, but it's not. The more options we have and the more diversified they are the better tech will move back towards. I'm not the one with the myopic view here. Enjoy your "on-device" models over in your utopia of a walled garden. | ||||||||
| ||||||||