Monte Carlo Tree Search with Robust Exploration

被引:0
|
作者
Imagawa, Takahisa [1 ,2 ]
Kaneko, Tomoyuki [1 ]
机构
[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan
[2] Japan Soc Promot Sci, Tokyo, Japan
来源
COMPUTERS AND GAMES, CG 2016 | 2016年 / 10068卷
关键词
D O I
10.1007/978-3-319-50935-8_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a new Monte-Carlo tree search method that focuses on identifying the best move. UCT which minimizes the cumulative regret, has achieved remarkable success in Go and other games. However, recent studies on simple regret reveal that there are better exploration strategies. To further improve the performance, a leaf to be explored is determined not only by the mean but also by the whole reward distribution. We adopted a hybrid approach to obtain reliable distributions. A negamax-style backup of reward distributions is used in the shallower half of a search tree, and UCT is adopted in the rest of the tree. Experiments on synthetic trees show that this presented method outperformed UCT and similar methods, except for trees having uniform width and depth.
引用
收藏
页码:34 / 46
页数:13
相关论文
共 50 条
  • [1] Monte Carlo Tree Search with Boltzmann Exploration
    Painter, Michael
    Baioumy, Mohamed
    Hawes, Nick
    Lacerda, Bruno
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Path Exploration Based on Monte Carlo Tree Search for Symbolic Execution
    Yeh, Chao-Chun
    Lu, Han-Lin
    Yeh, Jia-Jun
    Huang, Shih-Kun
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 33 - 37
  • [3] Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search
    Chaslot, Guillaume
    Fiter, Christophe
    Hoock, Jean-Baptiste
    Rimmel, Arpad
    Teytaud, Olivier
    ADVANCES IN COMPUTER GAMES, 2010, 6048 : 1 - +
  • [4] A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search
    Dam, Than
    D'Eramo, Carlo
    Peters, Jan
    Pajarinen, Joni
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 81
  • [5] A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search
    Dam, Tuan
    D’Eramo, Carlo
    Peters, Jan
    Pajarinen, Joni
    Journal of Artificial Intelligence Research, 2024, 81 : 511 - 577
  • [6] Decentralised Multi-Robot Exploration using Monte Carlo Tree Search
    Bone, Sean
    Bartolomei, Luca
    Kennel-Maushart, Florian
    Chli, Margarita
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7354 - 7361
  • [7] Regulation of Exploration for Simple Regret Minimization in Monte-Carlo Tree Search
    Liu, Yun-Ching
    Tsuruoka, Yoshimasa
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 35 - 42
  • [8] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [9] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [10] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537