Temporal difference learning and TD-Gammon

Gerald Tesauro

doi:10.1145/203330.203343

抄録

<jats:p>Ever since the days of Shannon's proposal for a chess-playing algorithm [12] and Samuel's checkers-learning program [10] the domain of complex board games such as Go, chess, checkers, Othello, and backgammon has been widely regarded as an ideal testing ground for exploring a variety of concepts and approaches in artificial intelligence and machine learning. Such board games offer the challenge of tremendous complexity and sophistication required to play at expert level. At the same time, the problem inputs and performance measures are clear-cut and well defined, and the game environment is readily automated in that it is easy to simulate the board, the rules of legal play, and the rules regarding when the game is over and determining the outcome.</jats:p>

収録刊行物

Communications of the ACM

Communications of the ACM 38 (3), 58-68, 1995-03

Association for Computing Machinery (ACM)

詳細情報詳細情報について

CRID: 1361418518917877248

NII論文ID: 80008180967

DOI: 10.1145/203330.203343

ISSN: 15577317; 00010782; http://id.crossref.org/issn/00010782

Web Site: https://dl.acm.org/doi/pdf/10.1145/203330.203343

データソース種別

Crossref
CiNii Articles

Temporal difference learning and TD-Gammon

抄録

収録刊行物

被引用文献 (29)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Temporal difference learning and TD-Gammon

抄録

収録刊行物

被引用文献 (29)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について