March 1, 1994

TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play

Key Points

Key points are not available for this paper at this time.

Abstract

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results, based on the TD(λ) reinforcement learning algorithm (Sutton 1988). Despite starting from random initial weights (and hence random initial strategy), TD-Gammon achieves a surprisingly strong level of play. With zero knowledge built in at the start of learning (i.e., given only a “raw” description of the board state), the network learns to play at a strong intermediate level. Furthermore, when a set of hand-crafted features is added to the network's input representation, the result is a truly staggering level of performance: the latest version of TD-Gammon is now estimated to play at a strong master level that is extremely close to the world's best human players.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Gerald Tesauro (Tue,) studied this question.

www.synapsesocial.com/papers/6a0a541e5b6facdebcb4e785 — DOI: https://doi.org/10.1162/neco.1994.6.2.215

Authors

Gerald Tesauro

Journals

Neural Computation

Actions

Institutions

IBM Research - Thomas J. Watson Research Center

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Learning to predict by the methods of temporal differences· 1988 · 2,784 citations
Learning to Predict by the Methods of Temporal Differences· 1988 · 3,943 citations
On Optimal Doubling in Backgammon· 1977 · 14 citations
Neurogammon Wins Computer Olympiad· 1989 · 70 citations
Practical issues in temporal difference learning· 1992 · 798 citations

TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider