| ▲ | thierrydamiba 25 minutes ago | |
Excellent write up. I’ve been thinking a lot about caching and agents so this was right ilup my alley. Have you experimented with using semantic cache on the chain of thought(what we get back from the providers anyways) and sending that to a dumb model for similar queries to “simulate” thinking? | ||