Differential Advising in Multiagent Reinforcement Learning

被引:18
|
作者
Ye, Dayong [1 ,2 ]
Zhu, Tianqing [4 ]
Cheng, Zishuo [1 ,2 ]
Zhou, Wanlei [3 ]
Yu, Philip S. [5 ]
机构
[1] Univ Technol Sydney, Ctr Cyber Secur & Privacy, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia
[3] City Univ Macau, Macau, Peoples R China
[4] China Univ Geosci, Sch Comp Sci, Wuhan 430000, Peoples R China
[5] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Robots; Differential privacy; Privacy; Reinforcement learning; Task analysis; Sensitivity; Computer science; Agent advising; differential privacy; multiagent reinforcement learning (MARL);
D O I
10.1109/TCYB.2020.3034424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Agent advising is one of the main approaches to improve agent learning performance by enabling agents to share advice. Existing advising methods have a common limitation that an adviser agent can offer advice to an advisee agent only if the advice is created in the same state as the advisee's state. However, in complex environments, it is a very strong requirement that two states are the same, because a state may consist of multiple dimensions and two states being the same means that all these dimensions in the two states are correspondingly identical. Therefore, this requirement may limit the applicability of existing advising methods to complex environments. In this article, inspired by the differential privacy scheme, we propose a differential advising method that relaxes this requirement by enabling agents to use advice in a state even if the advice is created in a slightly different state. Compared with the existing methods, agents using the proposed method have more opportunity to take advice from others. This article is the first to adopt the concept of differential privacy on advising to improve agent learning performance instead of addressing security issues. The experimental results demonstrate that the proposed method is more efficient in complex environments than the existing methods.
引用
收藏
页码:5508 / 5521
页数:14
相关论文
共 50 条
  • [21] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
  • [22] Multiagent Reinforcement Learning in Traffic and Transportation
    Bazzan, Ana
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN VEHICLES AND TRANSPORTATION SYSTEMS (CIVTS), 2014, : VII - VII
  • [23] WToE: Learning When to Explore in Multiagent Reinforcement Learning
    Dong, Shaokang
    Mao, Hangyu
    Yang, Shangdong
    Zhu, Shengyu
    Li, Wenbin
    Hao, Jianye
    Gao, Yang
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (08) : 4789 - 4801
  • [24] A survey on transfer learning for multiagent reinforcement learning systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703
  • [25] An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
    Yang, Tianpei
    Wang, Weixun
    Tang, Hongyao
    Hao, Jianye
    Meng, Zhaopeng
    Mao, Hangyu
    Li, Dong
    Liu, Wulong
    Zhang, Chengwei
    Hu, Yujing
    Chen, Yingfeng
    Fan, Changjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [26] Accelerating Multiagent Reinforcement Learning through Transfer Learning
    da Silva, Felipe Leno
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5034 - 5035
  • [27] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
    Dai, Pengcheng
    Liu, Hongzhe
    Yu, Wenwu
    Wang, He
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060
  • [28] A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 645 - 703
  • [29] Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
    Kamal, M.
    Murata, Junichi
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2007, 11 (03) : 181 - 191
  • [30] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
    Hao, Jianye
    Leung, Ho-Fung
    Ming, Zhong
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)