Key points are not available for this paper at this time.
Large language models (LLMs) have demonstrated remarkable capabilities in learning complex tasks purely from sequential data. To explore whether such models can internalize strategic world representations, We investigate whether generative transformer models can learn structured world representations from sequential data. Using the domain of Go life-and-death problems as a controlled micro-world, we train a GPT-style generative model to predict moves from serialized board states. Focusing on localized life-and-death (tsumego) scenarios, we train the model to predict valid next moves from serialized board states without providing any explicit Go rules or strategic supervision. Probing the model’s internal activations reveals structured representations aligned with liberties, eyes, and tactical group status. To interpret these representations, we introduce the Multi-Aspect World Probe (MAWP), a modular probing framework that disentangles tactical concepts into orthogonal dimensions. We further apply interventional techniques to manipulate internal representations and causally evaluate their impact on model predictions. Our results show that the proposed model achieves 94.7% accuracy in sequence correctness and 92.1% in outcome validity on life-and-death tasks. This work extends interpretability research into spatially structured domains and offers tools for understanding decision-making in sequence models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhikai Yang
Zhigang Meng
Zhiqiang Wen
AI
Changsha University
Hunan University of Technology
Henan Provincial Institute of Cultural Heritage and Archaeology
Building similarity graph...
Analyzing shared references across papers
Loading...
Yang et al. (Thu,) studied this question.
www.synapsesocial.com/papers/6a080acea487c87a6a40ccff — DOI: https://doi.org/10.3390/ai7050170
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: