|
| ▲ | reissbaker 2 hours ago | parent | next [-] |
| I'm biased because I run an inference company, https://synthetic.new. That being said I think we're pretty good at serving at GLM-5.2 — and other models, like Kimi K2.7! — and our privacy policy is quite good: zero data retention for prompts and completions on API requests. Our average streaming TPS for GLM-5.2 (aka, tokens after factoring out time-to-first-token, which varies based on geography) is 97tps over the last 24hrs, although it's slightly lower at peak traffic in the mornings PST where it's 50-70 tps. We're also subscription-based which is nicer for coding than e.g. Fireworks which is per-token billing. |
| |
| ▲ | yieldcrv 2 hours ago | parent [-] | | got a 500 error page on the site's chat, but I'll try the API | | |
| ▲ | reissbaker 43 minutes ago | parent [-] | | Interesting: I don't see anything in our error logs but we could be missing something (and personally the chat works for me + my unsubscribed test account). If you email us at hi@synthetic.new though we should be able to fix anything you're running into! |
|
|
|
| ▲ | eli 6 hours ago | parent | prev | next [-] |
| Fireworks.ai is solid. And if you care more about speed than cost they have a "fast" variant that I think just throws more hardware at the model for about 2x the cost. |
| |
| ▲ | david-gpu 4 hours ago | parent [-] | | The privacy policy indicates that they track you and share your data to ad networks like Meta. Yikes. | | |
| ▲ | pranaybhatia an hour ago | parent [-] | | Hi, PM at Fireworks here. We have zero data retention so we do not log any of your API requests. Realize you're talking about website activity which is different and will check and update on that too. |
|
|
|
| ▲ | pbgcp2026 40 minutes ago | parent | prev | next [-] |
| Run it on Amazon Bedrock or GCP vertex. No problems at all. |
|
| ▲ | Onavo 3 hours ago | parent | prev [-] |
| > the Chinese one won’t reply to subpoenas so thats a value add tbh That's not something that's definite. They are not quite like the Russians. A lot of the governments in Asia are overly pragmatic and will happily strong arm their companies to throw users under the bus for the sake of a trade deal. There's a reason why Snowden ran to the Russians and not China. Also, if they have any subsidiaries in the US, they may not have a choice in the matter. |