A Study on Cooperative Action Selection Considering Unfairness in Decentralized Multiagent Reinforcement Learning

被引:0
|
作者
Matsui, Toshihiro [1 ]
Matsuo, Hiroshi [1 ]
机构
[1] Nagoya Inst Technol, Showa Ku, Gokisyo Cho, Nagoya, Aichi 4668555, Japan
关键词
Multiagent System; Reinforcement Learning; Distributed Constraint Optimization; Unfairness; Leximin;
D O I
10.5220/0006203800880095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has been studied for cooperative learning and optimization methods in multiagent systems. In several frameworks of multiagent reinforcement learning, the system's whole problem is decomposed into local problems for agents. To choose an appropriate cooperative action, the agents perform an optimization method that can be performed in a distributed manner. While the conventional goal of the learning is the maximization of the total rewards among agents, in practical resource allocation problems, unfairness among agents is critical. In several recent studies of decentralized optimization methods, unfairness was considered a criterion. We address an action selection method based on leximin criteria, which reduces the unfairness among agents, in decentralized reinforcement learning. We experimentally evaluated the effects and influences of the proposed approach on classes of sensor network problems.
引用
收藏
页码:88 / 95
页数:8
相关论文
共 50 条
  • [21] CTDS: Centralized Teacher With Decentralized Student for Multiagent Reinforcement Learning
    Zhao, Jian
    Hu, Xunhan
    Yang, Mingyu
    Zhou, Wengang
    Zhu, Jiangcheng
    Li, Houqiang
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (01) : 140 - 150
  • [22] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
    Wang, Rose E.
    Kew, J. Chase
    Lee, Dennis
    Lee, Tsang-Wei Edward
    Zhang, Tingnan
    Ichter, Brian
    Tan, Jie
    Faust, Aleksandra
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725
  • [23] Cooperative Multiagent Reinforcement Learning Using Factor Graphs
    Zhang, Zhen
    Zhao, Dongbin
    PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 797 - 802
  • [24] Decentralized Cooperative Control of Multiple Energy Storage Systems in Urban Railway Based on Multiagent Deep Reinforcement Learning
    Zhu, Feiqin
    Yang, Zhongping
    Lin, Fei
    Xin, Yue
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2020, 35 (09) : 9368 - 9379
  • [25] Cooperative Multiagent Reinforcement Learning Coupled With A* Search for Ship Multicabin Equipment Layout Considering Pipe Route
    Zhang, Qiaoyu
    Lin, Yan
    JOURNAL OF SHIP PRODUCTION AND DESIGN, 2024, 40 (04): : 218 - 235
  • [26] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
    Jin, Chi
    Liu, Qinghua
    Wang, Yuanhao
    Yu, Tiancheng
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322
  • [27] Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure
    Kao, Hsu
    Wei, Chen-Yu
    Subramanian, Vijay
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [28] Decentralized Scheduling for Cooperative Localization With Deep Reinforcement Learning
    Peng, Bile
    Seco-Granados, Gonzalo
    Steinmetz, Erik
    Frohle, Markus
    Wymeersch, Henk
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4295 - 4305
  • [29] Cooperative channel assignment for VANETs based on multiagent reinforcement learning
    Wang, Yun-peng
    Zheng, Kun-xian
    Tian, Da-xin
    Duan, Xu-ting
    Zhou, Jian-shan
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (07) : 1047 - 1058
  • [30] The dynamics of reinforcement social learning in networked cooperative multiagent systems
    Hao, Jianye
    Huang, Dongping
    Cai, Yi
    Leung, Ho-fung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 111 - 122