Evolutionary Multiagent Transfer Learning With Model-Based Opponent Behavior Prediction

被引：12

作者：

Hou, Yaqing ^{[1
]}

Ong, Yew-Soon ^{[2
]}

Tang, Jing ^{[3
]}

Zeng, Yifeng ^{[3
]}

机构：

[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116024, Peoples R China

[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[3] Teesside Univ, Sch Comp, Middlesbrough TS1 3BX, Cleveland, England

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2021年 / 51卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Behavior prediction; evolutionary transfer learning (eTL); monotone submodular model selection; multiagent system (MAS);

D O I：

10.1109/TSMC.2019.2958846

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article embarks a study on multiagent transfer learning (TL) for addressing the specific challenges that arise in complex multiagent systems where agents have different or even competing objectives. Specifically, beyond the essential backbone of a state-of-the-art evolutionary TL framework (eTL), this article presents the novel TL framework with prediction (eTL-P) as an upgrade over existing eTL to endow agents with abilities to interact with their opponents effectively by building candidate models and accordingly predicting their behavioral strategies. To reduce the complexity of candidate models, eTL-P constructs a monotone submodular function, which facilitates to select Top-K models from all available candidate models based on their representativeness in terms of behavioral coverage as well as reward diversity. eTL-P also integrates social selection mechanisms for agents to identify their better-performing partners, thus improving their learning performance and reducing the complexity of behavior prediction by reusing useful knowledge with respect to their partners' mind universes. Experiments based on a partner-opponent minefield navigation task (PO-MNT) have shown that eTL-P exhibits the superiority in achieving higher learning capability and efficiency of multiple agents when compared to the state-of-the-art multiagent TL approaches.

引用

页码：5962 / 5976

页数：15

共 50 条

[41] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
Wang, Tonghao
Peng, Xingguang
Jin, Yaochu
Xu, Demin
MEMETIC COMPUTING, 2022, 14 (01) : 3 - 17
[42] Model Learning and Model-Based Testing
Aichernig, Bernhard K.
Mostowski, Wojciech
Mousavi, Mohammad Reza
Tappler, Martin
Taromirad, Masoumeh
MACHINE LEARNING FOR DYNAMIC SOFTWARE ANALYSIS: POTENTIALS AND LIMITS, 2018, 11026 : 74 - 100
[43] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
Tonghao Wang
Xingguang Peng
Yaochu Jin
Demin Xu
Memetic Computing, 2022, 14 : 3 - 17
[44] Problematic gambling behavior impacts model-based reinforcement learning performance
Brands, Angela Mariele
Mathar, David
Knauth, Kilian
Kuzmanovic, Bojana
Tittgemeyer, Marc
Peters, Jan
JOURNAL OF BEHAVIORAL ADDICTIONS, 2024, 13 : 97 - 98
[45] Model-based evolutionary algorithms: a short survey
Ran Cheng
Cheng He
Yaochu Jin
Xin Yao
Complex & Intelligent Systems, 2018, 4 : 283 - 292
[46] Model-based evolutionary algorithms: a short survey
Cheng, Ran
He, Cheng
Jin, Yaochu
Yao, Xin
COMPLEX & INTELLIGENT SYSTEMS, 2018, 4 (04) : 283 - 292
[47] Model-based architecture for evolutionary intelligent systems
Chi, SD
Lee, JS
TRANSACTIONS OF THE SOCIETY FOR COMPUTER SIMULATION INTERNATIONAL, 2001, 18 (01): : 2 - 8
[48] Using evolutionary algorithms for model-based clustering
Andrews, Jeffrey L.
McNicholas, Paul D.
PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 987 - 992
[49] Model-Based Deep Learning
Shlezinger, Nir
Whang, Jay
Eldar, Yonina C.
Dimakis, Alexandros G.
PROCEEDINGS OF THE IEEE, 2023, 111 (05) : 465 - 499
[50] Model-based machine learning
Bishop, Christopher M.
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (1984):

← 1 2 3 4 5 →