Remix.run Logo
csunoser 5 hours ago

Yes*. At least from my limited usage of deepseek-flash for a few billion tokens on openrouter, the cache-hit rate is >95%. And I simply used the claude code harness pointed at the openrouter anthropic compatible endpoint with no fluff.

port11 36 minutes ago | parent | next [-]

Did you get proper tool use? Some CC-driven models seem to get a bit off when it comes to MCP usage. For example: I really struggled to get Kimi to use Serena, which I think ended up costing too many tokens.

schaefer 5 hours ago | parent | prev [-]

thank you!