▲ | lumost 3 days ago | |
Human's are able to quickly converge on a pattern. While I doubt that I could immediately catch all LLMs, I can certainly catch a good portion by having simply worked with them for a time. On an infinite horizon Turing test, where I have the option to state that Chair A is a machine at any time - I would certainly expect to detect LLMs simply by virtue of their limited conversational range. | ||
▲ | tough 3 days ago | parent [-] | |
if anything i would do differently, i'd try things only machines can reliably do. unless the llm and the design for it is necessarily adversarial, not even going into red teaming or jailbreaks. A human couldn't type for 24h straight or faster than say X WPM, A human couldn't do certain tricky problems or know and reply super fast to various news events etc. Search/training date seems important factor too to tie in. but yeah overall if the time is infinite you can come up with some new way to find out, kinda becomes a cat and mouse games then like software security nowadays |