▲ | didip 6 days ago | |||||||||||||
So… I have a number of questions: 1. How did he tell Claude to “update” based on the notebook entries? 2. Won’t he eventually ran out of context window? 3. Won’t this be expensive when using hosted solutions? For just personal hacking, why not simply use ollama + your favorite model? 4. If one were to build this locally, can Vector DB similarity search or a hybrid combined with fulltext search be used to achieve this? I can totally imagine using pgai for the notebook logs feature and local ollama + deepseek for the inference. The email idea mentioned by other commenters is brilliant. But I don’t think you need a new mailbox, just pull from Gmail and grep if sender and receiver is yourself (aka the self tag). Thank you for sharing, OP’s project is something I have been thinking for a few months now. | ||||||||||||||
▲ | simonw 6 days ago | parent [-] | |||||||||||||
> Won’t he eventually ran out of context window? The "memories" table has a date column which is used to record the data when the information is relevant. The prompt can then be fed just information for today and the next few days - which will always be tiny. It's possible to save "memories" that are always included in the prompt, but even those will add up to not a lot of tokens over time. > Won’t this be expensive when using hosted solutions? You may be under-estimating how absurdly cheap hosted LLMs are these days. Most prompts against most models cost a fraction of a single cent, even for tens of thousands of tokens. Play around with my LLM pricing calculator for an illustration of that: https://tools.simonwillison.net/llm-prices > If one were to build this locally, can Vector DB similarity search or a hybrid combined with fulltext search be used to achieve this? Geoffrey's design is so simple it doesn't even need search - all it does is dump in context that's been stamped with a date, and there are so few tokens there's no need for FTS or vector search. If you wanted to build something more sophisticated you could absolutely use those. SQLite has surprisingly capable FTS built in and there are extensions like https://github.com/asg017/sqlite-vec for doing things with vectors. | ||||||||||||||
|