K2.5 was already pretty decent so I would try this. Starting at $15/month: https://www.kimi.com/membership/pricing

edit: Note that you can run it yourself with sufficient resources (e.g., companies), or access it from other providers too: https://openrouter.ai/moonshotai/kimi-k2.6/providers

▲

pbowyer 6 hours ago | parent | next [-]

What's the privacy/data security like? I can't find that on that page.

Edit: found it.

> We may use your Content to operate, maintain, improve, and develop the Services, to comply with legal obligations, to enforce our policies, and to ensure security. You may opt out of allowing your Content to be used for model improvement and research purposes by contacting us at membership@moonshot.ai. We will honor your choice in accordance with applicable law.

Section 3 of https://www.kimi.com/user/agreement/modelUse?version=v2

▲

gpm 5 hours ago | parent | next [-]

> We will honor your choice in accordance with applicable law.

So in other words only if you can point to a local law which requires them to comply with the opt out?

▲

jdasdf 4 hours ago | parent [-]

most laws enforce agreements.

	▲	gpm 4 hours ago \| parent [-]
		Yes... but the agreement only says they won't train on your data if the law is already preventing them from doing so.

▲

deaux 5 hours ago | parent | prev | next [-]

Yup, they train on your inputs and OpenRouter is complicit by claiming that Moonshot's ToS says that they don't. Contacted OpenRouter about this a while ago and was met with silence because it's bad for their business to stop lying about it.

▲

pixel_popping 5 hours ago | parent | prev [-]

You really rely on ToS from Anthropic/OpenAI to know if they use your prompts or not? It's on their servers, why wouldn't they use our data?

▲

SwellJoe 5 hours ago | parent | prev | next [-]

"sufficient resources" is going to be a lot of resources. I doubt this will run on even something like a Strix Halo or DGX Spark, even at 1-bit quantization. You'll need a 256GB or 512GB Mac Studio, or a monster GPU situation, to run it locally, I think, though quantized versions aren't showing up yet, to be sure.

	▲	4 hours ago \| parent [-]
		[deleted]

▲

wg0 6 hours ago | parent | prev [-]

How are the usage limits compared to Anthropic?

▲

greenavocado 6 hours ago | parent [-]

Anthropic has the worst usage limits in the industry

▲

andriy_koval 5 hours ago | parent [-]

gemini is worse imo

▲

deaux 5 hours ago | parent [-]

You're correct, Gemini chat limits are a joke at their chapest paid tier compared to both Claude and GPT. Especially crazy when you consider Gemini 3 Pro is more than twice as cheap as Opus 4.6 on the API. It's hard to run into pure chat limits on Claude even if you only use Opus on the cheapest tier, whereas with Gemini it's easy to hit.

Not sure about coding usage, Google being weird about these things I could see that quota being separate.

	▲	gessha an hour ago \| parent [-]
		I’m not sure what A/B test you’re part of but on Claude Code Pro, I hit every single one of my quotas without exception. If you analyze/process images it’s even worse: I hit rate limits first and if I use separate sessions, I hit my quotas too. I use up so many tokens that Jensen should hire me.