Remix.run Logo
ummzokbro 3 hours ago

This.

GLM5 also had this issue. When it was free on Openrouter / Kilo the model was rock solid though did degrade after 100k tokens gracefully. Same at launch with Zai aside from regular timeouts.

Somewhere around early-mid March zai did something significant to GLM5 - like KV quanting or model quanting or both.

After that it's been russian roulette. Sometimes it works flawlessly but very often (1/4 or 1/5 of the time) thinking tokens spill into main context and if you don't spot it happening it can do real damage - heavily corrupting files, deleting whole directories.

You can see the pain by visiting the zai discord - filled with reports of the issue yet radio silence by zai.

Tellingly despite being open source not a single provider will sell you access to this model at anything approaching the plans zai offers. The numbers just don't work so your choice is either pay per token significantly more and get reliability or put up with the bait and switch.