A Survey of Monte Carlo Tree Search Methods

被引:1843
作者
Browne, Cameron B. [1 ]
Powley, Edward [3 ]
Whitehouse, Daniel [3 ]
Lucas, Simon M. [2 ]
Cowling, Peter I. [3 ]
Rohlfshagen, Philipp [2 ]
Tavener, Stephen [1 ]
Perez, Diego [2 ]
Samothrakis, Spyridon [2 ]
Colton, Simon [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2RH, England
[2] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, Essex, England
[3] Univ Bradford, Sch Comp Informat & Media, Bradford BD7 1DP, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Artificial intelligence (AI); bandit-based methods; computer Go; game search; Monte Carlo tree search (MCTS); upper confidence bounds (UCB); upper confidence bounds for trees (UCT); GAME; INFORMATION; MODEL;
D O I
10.1109/TCIAIG.2012.2186810
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work.
引用
收藏
页码:1 / 43
页数:43
相关论文
共 239 条
[1]   EXPECTED-OUTCOME - A GENERAL-MODEL OF STATIC EVALUATION [J].
ABRAMSON, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (02) :182-193
[3]   Nested Monte-Carlo Search with AMAF Heuristic [J].
Akiyama, Haruhiko ;
Komiya, Kanako ;
Kotani, Yoshiyuki .
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, :172-176
[4]   PROOF-NUMBER SEARCH [J].
ALLIS, LV ;
VANDERMEULEN, M ;
VANDENHERIK, HJ .
ARTIFICIAL INTELLIGENCE, 1994, 66 (01) :91-124
[5]  
Althofer I., 2010, GAME SELF PLAY PURE
[6]  
Althofer I., 2008, LAZINESS MONTE CARLO
[7]  
[Anonymous], 2010, INT C COMP GAM SPRIN
[8]  
[Anonymous], 2008, P 23 AAAI C ART INT
[9]  
[Anonymous], 2009, P 12 INT IEEE C INTE
[10]  
[Anonymous], ADV COMPUTER GAMES