▲ | extr 7 days ago | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nice release. Part of the problem right now with OSS models (at least for enterprise users) is the diversity of offerings in terms of: - Speed - Cost - Reliability - Feature Parity (eg: context caching) - Performance (What quant level is being used...really?) - Host region/data privacy guarantees - LTS And that's not even including the decision of what model you want to use! Realistically if you want to use an OSS model instead of the big 3, you're faced with evalutating models/providers across all these axes, which can require a fair amount of expertise to discern. You may even have to write your own custom evaluations. Meanwhile Anthropic/OAI/Google "just work" and you get what it says on the tin, to the best of their ability. Even if they're more expensive (and they're not that much more expensive), you are basically paying for the priviledge of "we'll handle everything for you". I think until providers start standardizing OSS offerings, we're going to continue to exist in this in-between world where OSS models theoretically are at performance parity with closed source, but in practice aren't really even in the running for serious large scale deployments. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | coderatlarge 7 days ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
true but ignores handing over all your prompt traffic without any real legal protections as sama has pointed out: [1] https://californiarecorder.com/sam-altman-requires-ai-privil... | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | wkat4242 7 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Gpt-oss comes only in 4.5 bit quant. This is the native model, so there's no fp16 original |