March 3, 2026Open Access

Chess-GPT: A Transformer’s Approach to Chess

Key Points

Intermediate playing strength of chess models scored 1400-1500 Elo against Stockfish levels 0-2, showcasing potential.
Models achieved a 99.5–99.65% legal move rate, demonstrating high accuracy in move validation during gameplay.
Evaluation involved three GPT-2 architectures trained on various game notations, emphasizing dataset quality.
Findings suggest pure pattern recognition shows promise but lacks expert-level reasoning for advanced chess play.

Abstract

This study investigates the capabilities of transformer-based models in chess move generation and gameplay when trained solely on human game notations. Three GPT-2 architectures (two trained on unfiltered games and one on high-Elo games (>1800)) were evaluated for legal move accuracy and playing strength. The models achieved a 99.5–99.65% legal move rate and demonstrated intermediate playing strength (1400–1500 Elo) against Stockfish levels 0–2, despite lacking hardcoded chess rules or search algorithms. The filtered model showed marginal improvement, suggesting dataset quality impacts performance. These results highlight the promise of pure pattern recognition in constrained domains while underscoring its limitations in achieving expert-level play without symbolic reasoning.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Markus Palmheden

Tim Persson

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Chess-GPT: A Transformer’s Approach to Chess

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study