Remix clone Hacker News

new | show | ask | jobs Github

	▲	kurthr 2 hours ago
		Are there quantized (eg 4bit) models available yet? I assume the training was done in BF16, but it seems like most inference models are distributed in BF8 until they're quantized. edit ahh I see it on huggingface: https://huggingface.co/mlx-community/Olmo-3-1125-32B-4bit