Monte Carlo Tree Search in the Presence of Transition Uncertainty

被引:0
|
作者
Kohankhaki, Farnaz [1 ]
Aghakasiri, Kiarash [1 ,2 ]
Zhang, Hongming [1 ]
Wei, Ting-Han [1 ]
Gao, Chao [2 ]
Mueller, Martin [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Canada, Edmonton Res Ctr, Markham, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo Tree Search (MCTS) is an immensely popular search-based framework used for decision making. It is traditionally applied to domains where a perfect simulation model of the environment is available. We study and improve MCTS in the context where the environment model is given but imperfect. We show that the discrepancy between the model and the actual environment can lead to significant performance degradation with standard MCTS. We therefore develop Uncertainty Adapted MCTS (UA-MCTS), a more robust algorithm within the MCTS framework. We estimate the transition uncertainty in the given model, and direct the search towards more certain transitions in the state space. We modify all four MCTS phases to improve the search behavior by considering these estimates. We prove, in the corrupted bandit case, that adding uncertainty information to adapt UCB leads to tighter regret bound than standard UCB. Empirically, we evaluate UA-MCTS and its individual components on the deterministic domains from the MinAtar test suite. Our results demonstrate that UA-MCTS strongly improves MCTS in the presence of model transition errors.
引用
收藏
页码:20151 / 20158
页数:8
相关论文
共 50 条
  • [1] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [2] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [3] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537
  • [4] Monte Carlo Tree Search in Hex
    Arneson, Broderick
    Hayward, Ryan B.
    Henderson, Philip
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
  • [5] Monte Carlo tree search in Kriegspiel
    Ciancarini, Paolo
    Favini, Gian Piero
    ARTIFICIAL INTELLIGENCE, 2010, 174 (11) : 670 - 684
  • [6] MONTE CARLO TREE SEARCH: A TUTORIAL
    Fu, Michael C.
    2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 222 - 236
  • [7] Monte Carlo Tree Search for Quoridor
    Respall, Victor Massague
    Brown, Joseph Alexander
    Aslam, Hamna
    19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 5 - 9
  • [8] An Analysis of Monte Carlo Tree Search
    James, Steven
    Konidaris, George
    Rosman, Benjamin
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3576 - 3582
  • [9] Approximation Methods for Monte Carlo Tree Search
    Aksenov, Kirill
    Panov, Aleksandr, I
    PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 : 68 - 74
  • [10] A TUTORIAL INTRODUCTION TO MONTE CARLO TREE SEARCH
    Fu, Michael C.
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1178 - 1193