I wouldn’t be surprised if it’s literally zero. I’ve tried with a few LLMs, and they’re all very confident that they know how to play chess, but they just start hallucinating illegal moves immediately.
A few weeks ago, Gemini got confused when it tried to go first as black multiple times, so that’s the most immediate one I can remember. Last week, chatGPT offered to set up chess puzzles for me, but it made mistakes 3 out of 3 times.
Maybe I’ll try again. Is there a certain one you’ve seen good performance out of?
What’s ChatGPT’s rating? It’s impressive it can play chess at all, considering that’s not its core skill.
I wouldn’t be surprised if it’s literally zero. I’ve tried with a few LLMs, and they’re all very confident that they know how to play chess, but they just start hallucinating illegal moves immediately.
Immediately? When was the last time you tried? The newer models can hold a game well for 10-20 moves.
A few weeks ago, Gemini got confused when it tried to go first as black multiple times, so that’s the most immediate one I can remember. Last week, chatGPT offered to set up chess puzzles for me, but it made mistakes 3 out of 3 times.
Maybe I’ll try again. Is there a certain one you’ve seen good performance out of?