Evolutionary Multiagent Transfer Learning With Model-Based Opponent Behavior Prediction

被引:12
|
作者
Hou, Yaqing [1 ]
Ong, Yew-Soon [2 ]
Tang, Jing [3 ]
Zeng, Yifeng [3 ]
机构
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116024, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] Teesside Univ, Sch Comp, Middlesbrough TS1 3BX, Cleveland, England
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2021年 / 51卷 / 10期
基金
中国国家自然科学基金;
关键词
Behavior prediction; evolutionary transfer learning (eTL); monotone submodular model selection; multiagent system (MAS);
D O I
10.1109/TSMC.2019.2958846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article embarks a study on multiagent transfer learning (TL) for addressing the specific challenges that arise in complex multiagent systems where agents have different or even competing objectives. Specifically, beyond the essential backbone of a state-of-the-art evolutionary TL framework (eTL), this article presents the novel TL framework with prediction (eTL-P) as an upgrade over existing eTL to endow agents with abilities to interact with their opponents effectively by building candidate models and accordingly predicting their behavioral strategies. To reduce the complexity of candidate models, eTL-P constructs a monotone submodular function, which facilitates to select Top-K models from all available candidate models based on their representativeness in terms of behavioral coverage as well as reward diversity. eTL-P also integrates social selection mechanisms for agents to identify their better-performing partners, thus improving their learning performance and reducing the complexity of behavior prediction by reusing useful knowledge with respect to their partners' mind universes. Experiments based on a partner-opponent minefield navigation task (PO-MNT) have shown that eTL-P exhibits the superiority in achieving higher learning capability and efficiency of multiple agents when compared to the state-of-the-art multiagent TL approaches.
引用
收藏
页码:5962 / 5976
页数:15
相关论文
共 50 条
  • [41] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
    Wang, Tonghao
    Peng, Xingguang
    Jin, Yaochu
    Xu, Demin
    MEMETIC COMPUTING, 2022, 14 (01) : 3 - 17
  • [42] Model Learning and Model-Based Testing
    Aichernig, Bernhard K.
    Mostowski, Wojciech
    Mousavi, Mohammad Reza
    Tappler, Martin
    Taromirad, Masoumeh
    MACHINE LEARNING FOR DYNAMIC SOFTWARE ANALYSIS: POTENTIALS AND LIMITS, 2018, 11026 : 74 - 100
  • [43] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
    Tonghao Wang
    Xingguang Peng
    Yaochu Jin
    Demin Xu
    Memetic Computing, 2022, 14 : 3 - 17
  • [44] Problematic gambling behavior impacts model-based reinforcement learning performance
    Brands, Angela Mariele
    Mathar, David
    Knauth, Kilian
    Kuzmanovic, Bojana
    Tittgemeyer, Marc
    Peters, Jan
    JOURNAL OF BEHAVIORAL ADDICTIONS, 2024, 13 : 97 - 98
  • [45] Model-based evolutionary algorithms: a short survey
    Ran Cheng
    Cheng He
    Yaochu Jin
    Xin Yao
    Complex & Intelligent Systems, 2018, 4 : 283 - 292
  • [46] Model-based evolutionary algorithms: a short survey
    Cheng, Ran
    He, Cheng
    Jin, Yaochu
    Yao, Xin
    COMPLEX & INTELLIGENT SYSTEMS, 2018, 4 (04) : 283 - 292
  • [47] Model-based architecture for evolutionary intelligent systems
    Chi, SD
    Lee, JS
    TRANSACTIONS OF THE SOCIETY FOR COMPUTER SIMULATION INTERNATIONAL, 2001, 18 (01): : 2 - 8
  • [48] Using evolutionary algorithms for model-based clustering
    Andrews, Jeffrey L.
    McNicholas, Paul D.
    PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 987 - 992
  • [49] Model-Based Deep Learning
    Shlezinger, Nir
    Whang, Jay
    Eldar, Yonina C.
    Dimakis, Alexandros G.
    PROCEEDINGS OF THE IEEE, 2023, 111 (05) : 465 - 499
  • [50] Model-based machine learning
    Bishop, Christopher M.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (1984):