| ▲ | aiordienow a day ago | |
Context engineering is where the real leverage is right now. Most people focus on model selection but the retrieval and memory layer around the model makes a bigger difference in practice. What's your approach to managing context window limits — chunking with overlap, or some kind of relevance scoring before injection? | ||
| ▲ | Kevintbt 19 hours ago | parent [-] | |
As i said in the article, i have a filter for retrieval. I dont elaborate because i want to make it simple to read. You have the good structure, filtering, score relevance for every memories and indexes to facilitate the search ! You can check Supermemory infra its a bit how that works behind on chaaaaa.com | ||