▲ | tough 3 days ago | |||||||
that relies on the positive-aligned RLHF models most labs do. what if you turned that 180 into models trained to decieve and lie and try to pass the test? | ||||||||
▲ | lumost 3 days ago | parent | next [-] | |||||||
Human's are able to quickly converge on a pattern. While I doubt that I could immediately catch all LLMs, I can certainly catch a good portion by having simply worked with them for a time. On an infinite horizon Turing test, where I have the option to state that Chair A is a machine at any time - I would certainly expect to detect LLMs simply by virtue of their limited conversational range. | ||||||||
| ||||||||
▲ | oinfoalgo 3 days ago | parent | prev [-] | |||||||
If we had firms spending billions of dollars to pass the Turing test, it seems absurd to me to believe the current crop of models could not pass the test. Luckily, it is obvious that spending huge amounts of money to train models on how to best deceive humans with language is a terrible idea. That is also gaming the test and not in the spirit of generality that the test was trying to test for. Even playing Tic-tac-toe against GPT5 is a joke. The model knows enough of how the game works to let you play in text but doesn't even know when you won the game. The interesting part is that the model can even tell you why it sucks at tic-tac-toe "Because I’m not really thinking about the game like a human — I’m generating moves based on text patterns, not visualizing the board in the same intuitive way you do." 10 years ago it would not be conceivable we could have models that pass the Turing test but be hopeless at Tic-tac-toe and be able to tell you why they are not good at Tic-tac-toe. That right there is a total invalidation of the Turing test IMO. | ||||||||
|