| ▲ | devmor 8 hours ago | |||||||||||||||||||||||||||||||||||||
AI detectors that use text as a basis are not real. It is fundamentally impossible for them to exist. | ||||||||||||||||||||||||||||||||||||||
| ▲ | HarHarVeryFunny 7 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||
Huh? LLM output doesn't have the variety of human output, since they operate in fixed fashion - statistical inference followed by formulaic sampling. Additionally, the statistics used by LLMs are going be be similar across different LLMs since at scale its just "the statistics of the internet". Human output has much more variety, partly because we're individuals with our own reading/writing histories (which we're drawing upon when writing), and partly because we're not so formulaic in the way we generate. Individuals have their own writing styles and vocabulary, and one can identify specific authors to a reasonable degree of accuracy based on this. It's a bit like detecting cheating in a chess tournament. If an unusually high percentage of a player's moves are optimal computer moves, then there is a high likelihood that they were computer generated. Computers and humans don't pick moves in the same way, and humans don't have the computational power to always find "optimal" moves. Similarly with the "AI detectors" used to detect if kids are using AI to write their homework essays, or to detect if blog posts are AI generated ... if an unusually high percentage of words are predictable by what came before (the way LLMs work), and if those statistics match that of an LLM, then there is an extremely high chance that it was written by an LLM. Can you ever be 100% sure? Maybe not, but in reality human written text is never going to have such statistical regularity, and such an LLM statistical signature, that an AI detector gives it more than a 10-20% confidence of being AI, so when the detector says it's 80%+ confident something was AI generated, that effectively means 100%. There is of course also content that is part human part AI (human used LLM to fix up their writing), which may score somewhere in the middle. | ||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||
| ▲ | watsonL1F7 8 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||
[flagged] | ||||||||||||||||||||||||||||||||||||||