Monte Carlo Tree Search in the Presence of Transition Uncertainty

被引:0
|
作者
Kohankhaki, Farnaz [1 ]
Aghakasiri, Kiarash [1 ,2 ]
Zhang, Hongming [1 ]
Wei, Ting-Han [1 ]
Gao, Chao [2 ]
Mueller, Martin [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Canada, Edmonton Res Ctr, Markham, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo Tree Search (MCTS) is an immensely popular search-based framework used for decision making. It is traditionally applied to domains where a perfect simulation model of the environment is available. We study and improve MCTS in the context where the environment model is given but imperfect. We show that the discrepancy between the model and the actual environment can lead to significant performance degradation with standard MCTS. We therefore develop Uncertainty Adapted MCTS (UA-MCTS), a more robust algorithm within the MCTS framework. We estimate the transition uncertainty in the given model, and direct the search towards more certain transitions in the state space. We modify all four MCTS phases to improve the search behavior by considering these estimates. We prove, in the corrupted bandit case, that adding uncertainty information to adapt UCB leads to tighter regret bound than standard UCB. Empirically, we evaluate UA-MCTS and its individual components on the deterministic domains from the MinAtar test suite. Our results demonstrate that UA-MCTS strongly improves MCTS in the presence of model transition errors.
引用
收藏
页码:20151 / 20158
页数:8
相关论文
共 50 条
  • [41] Monte-Carlo Tree Search for Policy Optimization
    Ma, Xiaobai
    Driggs-Campbell, Katherine
    Zhang, Zongzhang
    Kochenderfer, Mykel J.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3116 - 3122
  • [42] Scalability and Parallelization of Monte-Carlo Tree Search
    Bourki, Amine
    Chaslot, Guillaume
    Coulm, Matthieu
    Danjean, Vincent
    Doghmen, Hassen
    Hoock, Jean-Baptiste
    Herault, Thomas
    Rimmel, Arpad
    Teytaud, Fabien
    Teytaud, Olivier
    Vayssiere, Paul
    Yu, Ziqin
    COMPUTERS AND GAMES, 2011, 6515 : 48 - 58
  • [43] On Monte Carlo Tree Search for Weighted Vertex Coloring
    Grelier, Cyril
    Goudet, Olivier
    Hao, Jin-Kao
    EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION, EVOCOP 2022, 2022, 13222 : 1 - 16
  • [44] Monte Carlo Tree Search for Bayesian Reinforcement Learning
    Vien, Ngo Anh
    Ertel, Wolfgang
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 138 - 143
  • [45] Enhancing Monte Carlo Tree Search for Playing Hearthstone
    Choe, Jean Seong Bjorn
    Kim, Jong-Kook
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [46] Monte Carlo tree search for materials design and discovery
    Dieb, Thaer M.
    Ju, Shenghong
    Shiomi, Junichiro
    Tsuda, Koji
    MRS COMMUNICATIONS, 2019, 9 (02) : 532 - 536
  • [47] Monte-Carlo Tree Search for Constrained POMDPs
    Lee, Jongmin
    Kim, Geon-Hyeong
    Poupart, Pascal
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [48] Monte Carlo tree search for materials design and discovery
    Thaer M. Dieb
    Shenghong Ju
    Junichiro Shiomi
    Koji Tsuda
    MRS Communications, 2019, 9 : 532 - 536
  • [49] Monte-Carlo Tree Search: To MC or to DP?
    Feldman, Zohar
    Domshlak, Carmel
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 321 - 326
  • [50] Monte-Carlo Tree Search in Settlers of Catan
    Szita, Istvan
    Chaslot, Guillaume
    Spronck, Pieter
    ADVANCES IN COMPUTER GAMES, 2010, 6048 : 21 - +