Solving multi-stage games with hierarchical learning automata that bootstrap

被引:1
|
作者
Peeters, Maarten [1 ]
Verbeeck, Katja [2 ]
Nowe, Ann [1 ]
机构
[1] Vrije Univ Brussel, Computat Modeling Lab, Pleinlaan 2, B-1050 Brussels, Belgium
[2] Maastricht Univ, MICC IKAT, NL-6200 MD Maastricht, Netherlands
来源
关键词
D O I
10.1007/978-3-540-77949-0_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical learning automata are shown to be an excellent tool for solving multi-stage games. However, most updating schemes used by hierarchical automata expect the multi-stage game to reach an absorbing state at which point the automata are updated in a Monte Carlo way. As such, the approach is infeasible for large multi-stage games (and even for problems with an infinite horizon) and the convergence process is slow. In this paper we propose an algorithm where the rewards don't have to travel all the way up to the top of the hierarchy and in which there is no need for explicit end-stages.
引用
收藏
页码:169 / +
页数:3
相关论文
共 50 条
  • [41] Effect of Bio-Inspired Multi-Stage Regulations for Diagnostic Molecular Automata
    Hirabayashi, Miki
    Ohashi, Hirotada
    Kubo, Tai
    2008 THIRD INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, 2008, : 55 - +
  • [42] Design of Bio-Inspired Multi-Stage Regulations for Diagnostic Molecular Automata
    Hirabayashi, Miki
    Ohashi, Hirotada
    Kubo, Tai
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2010, 7 (05) : 831 - 839
  • [44] Solving Scheduling Problems in a Multi-stage Multiproduct Batch Pharmaceutical Industry
    Kopanos, Georgios M.
    Mendez, Carlos A.
    Puigjaner, Luis
    PRES 2010: 13TH INTERNATIONAL CONFERENCE ON PROCESS INTEGRATION, MODELLING AND OPTIMISATION FOR ENERGY SAVING AND POLLUTION REDUCTION, 2010, 21 : 511 - 516
  • [45] A novel method for solving multi-stage distribution substation expansion planning
    Kaewmamuang, Komsan
    Siritaratiwat, Apirat
    Surawanitkun, Chayada
    Khunkitti, Pirat
    Chatthaworn, Rongrit
    5TH INTERNATIONAL CONFERENCE ON POWER AND ENERGY SYSTEMS ENGINEERING (CPESE 2018), 2019, 156 : 371 - 383
  • [46] Solving multi-stage parallel machine problem with limited intermediate buffers
    Wang, Binggang
    Rao, Yunqing
    Shao, Xinyu
    Wang, Mengchang
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2009, 37 (05): : 86 - 89
  • [47] Microneedle hierarchical structure construction for promoting multi-stage wound healing
    Zhang, Rui
    Tang, Pengfei
    Chen, Zhenfeng
    Tang, Ming
    Yang, Kun
    Tang, Youhong
    Zhang, Hongping
    Wang, Qingyuan
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2025, 674
  • [48] Pedestrian Detection with Unsupervised Multi-Stage Feature Learning
    Sermanet, Pierre
    Kavukcuoglu, Koray
    Chintala, Soumith
    LeCun, Yann
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3626 - 3633
  • [49] Multi-Stage Feature Constraints Learning for Age Estimation
    Xia, Min
    Zhang, Xu
    Liu, Wan'an
    Weng, Liguo
    Xu, Yiqing
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 2417 - 2428
  • [50] Clustering Acoustic Segments Using Multi-Stage Agglomerative Hierarchical Clustering
    Lerato, Lerato
    Niesler, Thomas
    PLOS ONE, 2015, 10 (10):