Can Monte-Carlo Tree Search learn to sacrifice?

被引：0

作者：

Nathan Companez

Aldeida Aleti

机构：

[1] Monash University,Faculty of Information Technology

来源：

Journal of Heuristics | 2016年 / 22卷

关键词：

Monte-Carlo Tree Search; Sacrifice moves; Artificial intelligence; Games;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

One of the most basic activities performed by an intelligent agent is deciding what to do next. The decision is usually about selecting the move with the highest expectation, or exploring new scenarios. Monte-Carlo Tree Search (MCTS), which was developed as a game playing agent, deals with this exploration–exploitation ‘dilemma’ using a multi-armed bandits strategy. The success of MCTS in a wide range of problems, such as combinatorial optimisation, reinforcement learning, and games, is due to its ability to rapidly evaluate problem states without requiring domain-specific knowledge. However, it has been acknowledged that the trade-off between exploration and exploitation is crucial for the performance of the algorithm, and affects the efficiency of the agent in learning deceptive states. One type of deception is states that give immediate rewards, but lead to a suboptimal solution in the long run. These states are known as trap states, and have been thoroughly investigated in previous research. In this work, we study the opposite of trap states, known as sacrifice states, which are deceptive moves that result in a local loss but are globally optimal, and investigate the efficiency of MCTS enhancements in identifying this type of moves.

引用

页码：783 / 813

页数：30

共 50 条

[21] Converging to a Player Model In Monte-Carlo Tree Search
Sarratt, Trevor
Pynadath, David V.
Jhala, Arnav
2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2014,
[22] A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH
Sato, Yoshikuni
Takahashi, Daisuke
Grimbergen, Reijer
ICGA JOURNAL, 2010, 33 (02) : 80 - 92
[23] AIs for Dominion Using Monte-Carlo Tree Search
Tollisen, Robin
Jansen, Jon Vegard
Goodwin, Morten
Glimsdal, Sondre
CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 43 - 52
[24] Parallel Monte-Carlo Tree Search with Simulation Servers
Kato, Hideki
Takeuchi, Ikuo
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 491 - 498
[25] Generalized Mean Estimation in Monte-Carlo Tree Search
Dam, Tuan
Klink, Pascal
D'Eramo, Carlo
Peters, Jan
Pajarinen, Joni
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2397 - 2404
[26] Automated Machine Learning with Monte-Carlo Tree Search
Rakotoarison, Herilalaina
Schoenauer, Marc
Sebag, Michele
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
[27] CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH
Chaslot, Guillaume M. J. B.
Winands, Mark H. M.
Szita, Istvan
van den Herik, H. Jaap
ICGA JOURNAL, 2008, 31 (03) : 145 - 156
[28] Monte-Carlo Tree Search Parallelisation for Computer Go
van Niekerk, Francois
Kroon, Steve
van Rooyen, Gert-Jan
Inggs, Cornelia P.
PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 129 - 138
[29] Parallel Monte-Carlo Tree Search for HPC Systems
Graf, Tobias
Lorenz, Ulf
Platzner, Marco
Schaefers, Lars
EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 365 - 376
[30] Monte-Carlo Tree Search for the Maximum Satisfiability Problem
Goffinet, Jack
Ramanujan, Raghuram
PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2016, 2016, 9892 : 251 - 267

← 1 2 3 4 5 →