| ▲ | Imustaskforhelp 4 days ago | |||||||
> But there is probably already some tradeoff, as GPT 3.5 was awesome at chess and current models don't seem trained extensively on chess anymore. Wow, I am so curious, can you provide me the source I am so interested in a chess LLM's benchmark as someone who occasionally plays chess. I have thought about creating things like these but it would be very interesting to find the best model at chess which isn't stockfish/lila but general purpose large language models. I also agree that there might be an explosion of purpose trained LLM's. I had this idea some year ago when there was llama / before deepseek that what if I want to write sveltekit and there are models like deepseek which know about sveltekit but they are so damn big and bloated when I only want to use sveltekit/svelte models. Yes there are thoughts on why we might need the whole network to get better quality but I genuinely feel like right now, the better quality is debtable thanks to all this benchmarkmaxxing and I would happily take a model trained on sveltekit on like preferably 4b-8b parameter but if an extremely good SOTA-ish model for sveltekit is even around 30-40b I would be happy since I could buy a gpu on my pc to run it or run it on my mac I think my brother who actually knows what he's talking about in the AI space, (unlike me), also said the same thing a few months back to me as well. In fact, its funny because I had asked him to please create a website comparing benchmarks of AI playing chess and having an option where we can make two AI LLM's play against each other and we can view it or we can also play against an LLM inside an actual chess board on the web and more..., I had given this idea to him a few months ago after the talk about small llm's really lol and he said that its good but he was busy right now. I think then later he might have forgotten about it and I had forgotten about it too until now. | ||||||||
| ▲ | radarsat1 4 days ago | parent | next [-] | |||||||
Just search for "chess LLM leaderboard" there are already several. Also check https://www.reddit.com/r/llmchess/ although admittedly it doesn't get a lot of traffic. | ||||||||
| ▲ | zurfer 3 days ago | parent | prev | next [-] | |||||||
this was the article I had in mind, when writing this: https://dynomight.substack.com/p/chess | ||||||||
| ||||||||
| ▲ | cindyllm 4 days ago | parent | prev [-] | |||||||
[dead] | ||||||||