| ▲ | acutesoftware 5 hours ago | ||||||||||||||||
This highlights that all RAG systems should be using metadata embedded into each of the vectorstores. Any result from the LLM needs to have a link to a document / chunk - which is turn links to a 'source file' which (should) have the file system owners id or another method of linking to a person. If the 'source information' cannot be linked to a person in the organisation, then it doesnt really belong in the RAG document store as authorative information. | |||||||||||||||||
| ▲ | salawat 4 hours ago | parent [-] | ||||||||||||||||
But you can't do that. That would implicitly out where the knowledge came from, and we all know that the AI industry has an existential incapability to actually cope with that little turd. Might work great for data you actually own, got access to. Imagine that applied back to the latent space of LLM's though. Plus, wouldn't all of that eat through context window like no tomorrow? | |||||||||||||||||
| |||||||||||||||||