Remix.run Logo
Save Claude Code Tokens with Smart Routing(github.com)
11 points by FrancescoMassa 13 hours ago | 4 comments
nithiink 4 hours ago | parent | next [-]

How do you handle prompt caching? A lot of cost savings for a single model chat come from cache hits on the conversation context, and switching models invalidates that cache — the new model has to reprocess everything at full input price.

patch_dev 4 hours ago | parent | prev | next [-]

What does this solve that well used subagents doesn't solve already?

FrancescoMassa 3 hours ago | parent [-]

On our tests subagents & well used workflows are 20-30% more expensive for context & token efficiency

13 hours ago | parent | prev [-]
[deleted]