| ▲ | wolttam 7 hours ago | |
Deepinfra's implementation of it is not correct. Thinking is not preserved, and they're not responding to my submitted issue about it. I also regularly experience Deepinfra slow to an absolute crawl - I've actually gotten more consistent performance from Z.ai. I really liked Deepinfra but something doesn't seem right over there at the moment. | ||
| ▲ | cmrdporcupine 6 hours ago | parent [-] | |
Damn. Yeah, that sucks. I did play with it earlier again and it did seem to slow down. It's frankly a bummer that there's not seemingly a better serving option for GLM 5.1 than z.AI, who seems to have reliability and cost issues. | ||