| ▲ | adchurch 2 hours ago | ||||||||||||||||||||||
You're right and that's why we built the router to be cache aware! Once it starts using one model, the threshold to switch to another model will be higher because the additional cost of the cache miss needs to be worth the cost savings or quality increase. This is the key thing that other routers we've seen miss: they're stateless so for a coding agent use case you end up spending more money due to all the cache misses. | |||||||||||||||||||||||
| ▲ | alansaber 2 hours ago | parent | next [-] | ||||||||||||||||||||||
That is interesting, sounds like in practice you only end up routing between 2 models | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | mthoms an hour ago | parent | prev [-] | ||||||||||||||||||||||
This is a key point. I don't know if you can still edit your submission, but I think this would be helpful to mention up front. I'm looking forward to testing this. | |||||||||||||||||||||||