| ▲ | stavros 6 hours ago | |||||||||||||||||||||||||
How can you have cache hit efficiency? Isn't it just a matter of not changing the previous context? I don't understand what knobs there are to tweak on this. | ||||||||||||||||||||||||||
| ▲ | everforward 6 hours ago | parent [-] | |||||||||||||||||||||||||
> Isn't it just a matter of not changing the previous context? Yes, but a lot of harnesses change previous context. E.g. the system prompt injects the current time/date, working directory, files in the working directory, etc. Compaction also changes the whole previous context. I _think_ changing the list of tools also invalidates cache, so invoking a subagent with different tools would invalidate the cache. My vague impression is that it's in a similar vein to functional programming languages. It generally disallows doing things that lead to bugs (cache misses in this case), and presumably allows you to do those things in a way that makes it much clearer that this is likely to cause cache misses. I would guess that in this paradigm, you don't mutate your existing session, you derive a new session by mutating the prior context into a new context. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||