Remix clone Hacker News

new | show | ask | jobs Github

	▲	hsaliak 2 hours ago
		It takes your query, computes the complexity of the request, and tries to route it to the appropriate model. There is a /manual command i think, to pick the right model. They mask the 429s well in Gemini-Cli - if an endpoint is rate limited, they try another, or route to another model, etc to keep service availability up. Your experience on the 429s is consistent with mine - the 429s is the first thing they need to fix. Fix that and they have a solid model at a good price point. I use my own coding agent (https://github.com/hsaliak/std_slop) and not being able to bring my (now cancelled) AI account with Google to it is a bummer. I'd still use it with the Code Assist Standard license if the google cloud API subscription allows for it but I have no clarification.
	▲	tempest_ an hour ago \| parent [-]
		> It takes your query, computes the complexity of the request, and tries to route it to the appropriate model. There is a /manual command i think, to pick the right model. That is what is should do, but there is no > 2.5 model shown in /model and it always picks a 2.5 model. Ive enabled preview models in the google cloud project as well. If I pass the 3 model in start param it shows 3 in the lower right corner but it is still using 2.5. I know google has issues dealing with paying customers but the current state is a shit show. If you go to the gemini-cli repo its a deluge of issues and ai slop. It seems there is a cadre of people jumping to be the first person to pump an issue into claude and get some sort of PR clout. It might be good but it needs more time to cook, or they need to take a step back and evaluate what they should consider a paid product.