Remix clone Hacker News

new | show | ask | jobs Github

	▲	KronisLV 6 days ago
		The GPT-OSS-120B release was pretty decent and you could run it on vLLM, Ollama and a bunch of other stuff on day one, despite MXFP4, are you not entertained? I mean, it's even close to GPT-5 mini in some benchmarks: https://llm-stats.com/ As for the Chinese models, yes, there are quite a few good ones. For programming and development, my current daily driver is the Qwen3 Coder 480B model: https://qwen3lm.com/ I have it running on Cerebras: https://www.cerebras.ai/pricing Personally I think Claude still has the best results, but Qwen3 is loosely in the same ballpark and Cerebras inference is measured in thousands of tokens per second, in addition to giving me 24M tokens per day for 50 bucks a month in total. That was enough to get me to switch over. Aside from that the GLM-4.5 is pretty good: https://glm45.org/ And so is ERNIE 4.5: https://ernie.baidu.com/blog/posts/ernie4.5/ Either way, happy to see what the future holds for Mistral, it's cool to have EU options too! Either way, more competition prevents complacency and stagnation, and should be a good thing for everyone.
	▲	6 days ago \| parent [-]
		[deleted]