▲ | actsasbuffoon 3 days ago | ||||||||||||||||||||||||||||||||||
This misses the point. LLMs will do things like move a knight by a single square as if it were a pawn. Chess is an extremely well understood game, and the rules about how things move is almost certainly well-represented in the training data. These models cannot even make legal chess moves. That’s incredibly basic logic, and it shows how LLMs are still completely incapable of reasoning or understanding. Many kinds of task are never going to be possible for LLMs unless that changes. Programming is one of those tasks. | |||||||||||||||||||||||||||||||||||
▲ | og_kalu 3 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
>These models cannot even make legal chess moves. That’s incredibly basic logic, and it shows how LLMs are still completely incapable of reasoning or understanding. Yeah they can. There's a link I shared to prove it which you've conveniently ignored. LLMs learn by predicting, failing and getting a little better, rinse and repeat. Pre-training is not like reading a book. LLMs trained on chess games play chess just fine. They don't make the silly mistakes you're talking about and they very rarely make illegal moves. There's gpt-3.5-turbo-instruct which i already shared and plays at around 1800 ELO. Then there's this grandmaster level chess transformer - https://arxiv.org/abs/2402.04494. They're also a couple of models that were trained in the Eleuther AI discord that reached about 1100-1300 Elo. I don't know what the peak of LLM Chess playing looks like but this is clearly less of a 'LLMs can't do this' problem and more 'Open AI/Anthropic/Google etc don't care if their models can play Chess or not' problem. So are they capable of reasoning now or would you like to shift the posts ? | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
▲ | simonw 3 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
Saying programming is a task that is "never going to be possible" for an LLM is a big claim, given how many people have derived huge value from having LLMs write code for them over the past two years. (Unless you're arguing against the idea that LLMs are making programmers obsolete, in which case I fully agree with you.) | |||||||||||||||||||||||||||||||||||
|