Solving multi-stage games with hierarchical learning automata that bootstrap

被引：1

作者：

Peeters, Maarten ^{[1
]}

Verbeeck, Katja ^{[2
]}

Nowe, Ann ^{[1
]}

机构：

[1] Vrije Univ Brussel, Computat Modeling Lab, Pleinlaan 2, B-1050 Brussels, Belgium

[2] Maastricht Univ, MICC IKAT, NL-6200 MD Maastricht, Netherlands

来源：

ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS | 2008年 / 4865卷

关键词：

D O I：

10.1007/978-3-540-77949-0_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hierarchical learning automata are shown to be an excellent tool for solving multi-stage games. However, most updating schemes used by hierarchical automata expect the multi-stage game to reach an absorbing state at which point the automata are updated in a Monte Carlo way. As such, the approach is infeasible for large multi-stage games (and even for problems with an infinite horizon) and the convergence process is slow. In this paper we propose an algorithm where the rewards don't have to travel all the way up to the top of the hierarchy and in which there is no need for explicit end-stages.

引用

页码：169 / +

页数：3

共 50 条

[11] Coordinated exploration in conflicting multi-stage games
Peeters, Maarten
Kononen, Ville
Verbeeck, Katja
Van Segbroeck, Sven
Nowe, Ann
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 391 - +
[12] MULTI-STAGE GAMES WITH NON-PRESCRIBED TERMINAL STAGE
BLAQUIERE, A
GANI, N
COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1969, 268 (07): : 428 - +
[13] A multi-stage hierarchical approach to alloy design
P. K. Ray
T. Brammer
Y. Y. Ye
M. Akinc
M. J. Kramer
JOM, 2010, 62 : 25 - 29
[14] A multi-stage hierarchical approach to alloy design
Ray, P. K.
Brammer, T.
Ye, Y. Y.
Akinc, M.
Kramer, M. J.
JOM, 2010, 62 (10) : 25 - 29
[15] A Comparison of Existing Bootstrap Algorithms for Multi-Stage Sampling Designs
Chen, Sixia
Haziza, David
Mashreghi, Zeinab
STATS, 2022, 5 (02): : 521 - 537
[16] As Safe As It Gets: Near-Optimal Learning in Multi-Stage Games with Imperfect Monitoring
Kuminov, Danny
Tennenholtz, Moshe
ECAI 2008, PROCEEDINGS, 2008, 178 : 438 - +
[17] Strategic investments in multi-stage General Lotto games
Chandan, Rahul
Paarporn, Keith
Alizadeh, Mahnoosh
Marden, Jason R.
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4444 - 4448
[18] A Multi-stage Theory of Neurofeedback Learning
Davelaar, Eddy J.
AUGMENTED COGNITION. THEORETICAL AND TECHNOLOGICAL APPROACHES, AC 2020, PT I, 2020, 12196 : 118 - 128
[19] A Novel Multi-Stage Approach for Hierarchical Intrusion Detection
Verkerken, Miel
D'hooge, Laurens
Sudyana, Didik
Lin, Ying-Dar
Wauters, Tim
Volckaert, Bruno
De Turck, Filip
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 3915 - 3929
[20] The Philosophy of Solving Flow Interference in Multi-stage Switch
Tian, Yupeng
Zhang, Xiaoping
Li, Dehu
Zhou, Chen
18TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2012): GREEN AND SMART COMMUNICATIONS FOR IT INNOVATION, 2012, : 680 - 685

← 1 2 3 4 5 →