▲ | og_kalu 3 days ago | |||||||||||||||||||||||||
>These models cannot even make legal chess moves. That’s incredibly basic logic, and it shows how LLMs are still completely incapable of reasoning or understanding. Yeah they can. There's a link I shared to prove it which you've conveniently ignored. LLMs learn by predicting, failing and getting a little better, rinse and repeat. Pre-training is not like reading a book. LLMs trained on chess games play chess just fine. They don't make the silly mistakes you're talking about and they very rarely make illegal moves. There's gpt-3.5-turbo-instruct which i already shared and plays at around 1800 ELO. Then there's this grandmaster level chess transformer - https://arxiv.org/abs/2402.04494. They're also a couple of models that were trained in the Eleuther AI discord that reached about 1100-1300 Elo. I don't know what the peak of LLM Chess playing looks like but this is clearly less of a 'LLMs can't do this' problem and more 'Open AI/Anthropic/Google etc don't care if their models can play Chess or not' problem. So are they capable of reasoning now or would you like to shift the posts ? | ||||||||||||||||||||||||||
▲ | int_19h 3 days ago | parent [-] | |||||||||||||||||||||||||
I think the point here is that if you have to pretrain it for every specific task, it's not artificial general intelligence, by definition. | ||||||||||||||||||||||||||
|