| ▲ | csunoser 5 hours ago | |
Yes*. At least from my limited usage of deepseek-flash for a few billion tokens on openrouter, the cache-hit rate is >95%. And I simply used the claude code harness pointed at the openrouter anthropic compatible endpoint with no fluff. | ||
| ▲ | port11 36 minutes ago | parent | next [-] | |
Did you get proper tool use? Some CC-driven models seem to get a bit off when it comes to MCP usage. For example: I really struggled to get Kimi to use Serena, which I think ended up costing too many tokens. | ||
| ▲ | schaefer 5 hours ago | parent | prev [-] | |
thank you! | ||