Solving multi-stage games with hierarchical learning automata that bootstrap

被引:1
|
作者
Peeters, Maarten [1 ]
Verbeeck, Katja [2 ]
Nowe, Ann [1 ]
机构
[1] Vrije Univ Brussel, Computat Modeling Lab, Pleinlaan 2, B-1050 Brussels, Belgium
[2] Maastricht Univ, MICC IKAT, NL-6200 MD Maastricht, Netherlands
来源
关键词
D O I
10.1007/978-3-540-77949-0_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical learning automata are shown to be an excellent tool for solving multi-stage games. However, most updating schemes used by hierarchical automata expect the multi-stage game to reach an absorbing state at which point the automata are updated in a Monte Carlo way. As such, the approach is infeasible for large multi-stage games (and even for problems with an infinite horizon) and the convergence process is slow. In this paper we propose an algorithm where the rewards don't have to travel all the way up to the top of the hierarchy and in which there is no need for explicit end-stages.
引用
收藏
页码:169 / +
页数:3
相关论文
共 50 条
  • [11] Coordinated exploration in conflicting multi-stage games
    Peeters, Maarten
    Kononen, Ville
    Verbeeck, Katja
    Van Segbroeck, Sven
    Nowe, Ann
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 391 - +
  • [12] MULTI-STAGE GAMES WITH NON-PRESCRIBED TERMINAL STAGE
    BLAQUIERE, A
    GANI, N
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1969, 268 (07): : 428 - +
  • [13] A multi-stage hierarchical approach to alloy design
    P. K. Ray
    T. Brammer
    Y. Y. Ye
    M. Akinc
    M. J. Kramer
    JOM, 2010, 62 : 25 - 29
  • [14] A multi-stage hierarchical approach to alloy design
    Ray, P. K.
    Brammer, T.
    Ye, Y. Y.
    Akinc, M.
    Kramer, M. J.
    JOM, 2010, 62 (10) : 25 - 29
  • [15] A Comparison of Existing Bootstrap Algorithms for Multi-Stage Sampling Designs
    Chen, Sixia
    Haziza, David
    Mashreghi, Zeinab
    STATS, 2022, 5 (02): : 521 - 537
  • [16] As Safe As It Gets: Near-Optimal Learning in Multi-Stage Games with Imperfect Monitoring
    Kuminov, Danny
    Tennenholtz, Moshe
    ECAI 2008, PROCEEDINGS, 2008, 178 : 438 - +
  • [17] Strategic investments in multi-stage General Lotto games
    Chandan, Rahul
    Paarporn, Keith
    Alizadeh, Mahnoosh
    Marden, Jason R.
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4444 - 4448
  • [18] A Multi-stage Theory of Neurofeedback Learning
    Davelaar, Eddy J.
    AUGMENTED COGNITION. THEORETICAL AND TECHNOLOGICAL APPROACHES, AC 2020, PT I, 2020, 12196 : 118 - 128
  • [19] A Novel Multi-Stage Approach for Hierarchical Intrusion Detection
    Verkerken, Miel
    D'hooge, Laurens
    Sudyana, Didik
    Lin, Ying-Dar
    Wauters, Tim
    Volckaert, Bruno
    De Turck, Filip
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 3915 - 3929
  • [20] The Philosophy of Solving Flow Interference in Multi-stage Switch
    Tian, Yupeng
    Zhang, Xiaoping
    Li, Dehu
    Zhou, Chen
    18TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2012): GREEN AND SMART COMMUNICATIONS FOR IT INNOVATION, 2012, : 680 - 685