>Problem 3: "you'll hit the 200k token limit..." ... Suggestion: use 1 million context window LLMs.

Yes, if the model someone is using only has 200k token limit, that would immediately suggest to me that it really isn't a sophisticated enough model.

Most of my coding sessions end up being about 350k tokens long when I finish, it wouldn't even fit in a 200k context. And that isn't counting the cache-reads by subagents, etc.

It's worth spending some time with the best Opus / GPT model, to at least get a sense of what the frontier is like.

▲

jeffyaw 5 hours ago | parent [-]

minimax m3 has a 1M token context window so not sure how op is hitting this 200k. maybe the plan they're on? or some setting in some layer of whatever their dev tooling is using.

	▲	bel8 3 hours ago \| parent [-]
		Yeah it's probably some free or entry level LLM service. Even DeepSeek v4 Flash has 1million context size.