| ▲ | Tiberium 4 hours ago | |||||||||||||
Sounds like a hallucination unless proven otherwise, even the leading LLMs can do those from time to time, and they will always appear plausible like that. Also could be the session having a lot previous context, like 800K+, which (I think) makes hallucinations more likely. Relevant comment from the OP which makes a hallucination more likely: > There is one tool call result that includes a string that printed a pathname including minecraft.py because it was listing the files in a Python virtual environment and the Pygments package has a lexer called minecraft.py | ||||||||||||||
| ▲ | andy99 3 hours ago | parent | next [-] | |||||||||||||
I realize hallucination has no precise definition but this doesn’t sound at all like anything I’ve ever heard called hallucination. Hallucination is usually plausible wrong answers or made up info that ends up fitting the most likely response (like a manufactured citation) and comes from the way LLMs work at predicting tokens. This example demonstrates completely implausible output, it’s not something that fits with hallucination. All that said, it doesn’t require cross session leakage, it could just be training data or like those nightingale (probably the wrong bird*) data generations where they just prompt an LLM with nothing and it starts spitting out conversations. I see a bunch of downstream comments about caching, sounds like maybe there’s an error where it loads nothing instead of the cache and so starts spitting out random generations. * edit: it’s magpie. Worth looking at the concept, I’m not sure people realize they LLMs generate random conversations when prompted with nothing, this seems at least as likely as sessions leaking: https://github.com/magpie-align/magpie | ||||||||||||||
| ||||||||||||||
| ▲ | macNchz 4 hours ago | parent | prev | next [-] | |||||||||||||
The person posting this claims to have reproduced in a separate context down the thread: > Same thing just happened on a Claude Mobile session in same Enterprise account. Common theme in both is Sonnet 5, first response after more than 5 minutes (cache miss). | ||||||||||||||
| ▲ | xyzzy_plugh 4 hours ago | parent | prev | next [-] | |||||||||||||
I don't disagree but this sort of thing has to be investigated regardless. It's unfortunate that there is so little transparency that even if they deny there was a leak we will never know for certain. | ||||||||||||||
| ▲ | alserio 3 hours ago | parent | prev | next [-] | |||||||||||||
Why? what does make it more likely? | ||||||||||||||
| ▲ | paulddraper 2 hours ago | parent | prev | next [-] | |||||||||||||
Exactly. If you've never had an LLM (all models) suddenly start spouting nonsense in a completely different language...you haven't been using LLMs that much. They will go absolutely insane some % of the time. | ||||||||||||||
| ||||||||||||||
| ▲ | prima-facie 2 hours ago | parent | prev [-] | |||||||||||||
[dead] | ||||||||||||||