Scalable Reinforcement Learning for Multiagent Networked Systems

被引:10
|
作者
Qu, Guannan [1 ]
Wierman, Adam [2 ]
Li, Na [3 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] CALTECH, Dept Comp & Math Sci, Pasadena, CA 91125 USA
[3] Harvard Univ, Sch Engn & Appl Sci, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
stochastic systems; networked systems; reinforcement learning; CSMA; COMPLEXITY; AVERAGE;
D O I
10.1287/opre.2021.2226
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We study reinforcement learning (RL) in a setting with a network of agents whose states and actions interact in a local manner where the objective is to find localized policies such that the (discounted) global reward is maximized. A fundamental challenge in this setting is that the state-action space size scales exponentially in the number of agents, rendering the problem intractable for large networks. In this paper, we propose a scalable actor critic (SAC) framework that exploits the network structure and finds a localized policy that is an O(rho(kappa+1))-approximation of a stationary point of the objective for some rho is an element of(0, 1), with complexity that scales with the local state-action space size of the largest kappa-hop neighborhood of the network. We illustrate our model and approach using examples fromwireless communication, epidemics, and traffic.
引用
收藏
页码:3601 / 3628
页数:28
相关论文
共 50 条
  • [1] The dynamics of reinforcement social learning in networked cooperative multiagent systems
    Hao, Jianye
    Huang, Dongping
    Cai, Yi
    Leung, Ho-fung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 111 - 122
  • [2] Networked Reinforcement Social Learning towards Coordination in Cooperative Multiagent Systems
    Hao, Jianye
    Huang, Dongping
    Cai, Yi
    Leung, Ho-fung
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 374 - 378
  • [3] Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward
    Qu, Guannan
    Lin, Yiheng
    Wierman, Adam
    Li, Na
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 256 - 266
  • [5] Coordination in multiagent reinforcement learning systems
    Kamal, MAS
    Murata, J
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 1197 - 1204
  • [6] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [7] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [8] A survey on transfer learning for multiagent reinforcement learning systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703
  • [9] A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 645 - 703
  • [10] Networked Signal and Information Processing: Learning by multiagent systems
    Vlaski, Stefan
    Kar, Soummya
    Sayed, Ali H.
    Moura, Jose M. F.
    IEEE SIGNAL PROCESSING MAGAZINE, 2023, 40 (05) : 92 - 105