Multi-controller fusion in multi-layered reinforcement learning

被引:13
|
作者
Takahashi, Y [1 ]
Asada, M [1 ]
机构
[1] Osaka Univ, Adapt Machine Syst Grad Sch Engn, Suita, Osaka 5650871, Japan
关键词
D O I
10.1109/MFI.2001.1013500
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes multi-controller fusion in multi-layered reinforcement learning based on which an autonomous robot learns from lower level behaviors to higher level ones through its life. In the previous work [1], we proposed a method enables the behavior learning system to acquire several knowledges/policies, to assign sub-tasks to learning modules by itself, to organize its own hierarchical structure, and to simplify the whole system by using only one kind of learning mechanism in all learning modules. However, it has a few drawbacks. The system cannot handle the change of the state variables. It is easily caught by a curse of dimension, if number of the state variables is large. In this paper, we propose an approach of decomposing the large state space at the bottom level into several subspaces and merge those subspaces at the higher level. This allows the system to reuse the policies learned before, to learn the policy against the new features, and therefore to avoid the curse of dimension. To show the validity of the proposed method, we apply it to a simple soccer situation in the context of RoboCup, and show the experimental results.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [41] Multi-Controller Placement for Load Balancing in SDWAN
    Yang, Kongzhe
    Guo, Daoxing
    Zhang, Bangning
    Zhao, Bing
    IEEE ACCESS, 2019, 7 : 167278 - 167289
  • [42] An Intensive Security Architecture with Multi-Controller for SDN
    Qi, Chao
    Wu, Jiangxing
    Hu, Hongchao
    Cheng, Guozhen
    Liu, Wenyan
    Ai, Jianjian
    Yang, Chao
    2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), 2016,
  • [43] The Influence of Metal Reinforcement upon the Ablative Properties of Multi-Layered Composites
    Przybylek, Pawel
    Komorek, Andrzej
    Szczepaniak, Robert
    ADVANCES IN SCIENCE AND TECHNOLOGY-RESEARCH JOURNAL, 2023, 17 (02) : 111 - 119
  • [44] Multi-controller multi-objective locomotion planning for legged robots
    Brandao, Martim
    Fallon, Maurice
    Havoutis, Ioannis
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4714 - 4721
  • [45] Multi-layered image compression
    Meyer, FG
    Averbuch, AZ
    Stromberg, JO
    Coifman, RR
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VI, 1998, 3458 : 128 - 139
  • [46] ANALYSIS OF MULTI-LAYERED FILMS
    SCARPACE, FL
    VOSS, AW
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1974, 40 (06): : 732 - 732
  • [47] Risk - A Multi-Layered Approach
    Neff, Peter S.
    INTERNATIONAL JOURNAL OF AVIATION AERONAUTICS AND AEROSPACE, 2020, 7 (03): : 1 - 16
  • [48] Multi-layered planar firefighting
    Deutsch, Arye
    Feldheim, Ohad Noy
    Hod, Rani
    DISCRETE MATHEMATICS, 2022, 345 (12)
  • [49] MULTI-LAYERED IMAGE RETARGETING
    Sugimoto, Shiori
    Shimizu, Shinya
    Kimata, Hideaki
    Kojima, Akira
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 3001 - 3004
  • [50] Psychology - a multi-layered reality
    Maree, Kobus
    SOUTH AFRICAN JOURNAL OF PSYCHOLOGY, 2009, 39 (03) : 263 - 265