You are maybe confusing caching and context windows. Caching is mainly about keeping inference costs down