Decentralized approximate dynamic programming for dynamic networks of agents

被引:6
|
作者
Lakshmanan, Hariharan [1 ]
Pucci de Farias, Daniela [2 ]
机构
[1] MIT, Dept Civil Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ACC.2006.1656455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider control systems consisting of teams of agents operating in stochastic environments and communicating through a network with dynamic topology. An optimal centralized control policy can be derived from the Q-function associated with the problem. However, computing and storing the Q-function is intractable for systems of practical scale, and having a centralized policy may lead to prohibitive requirements on communication between agents. On the other hand, it has been shown that decentralized optimal control is NP-hard even in the case of small systems. Here we propose a general approach for decentralized control based on approximate dynamic programming. We consider approximations to the Q-function via local approximation architectures, which lead to decentralization of the task of choosing control actions and can be computed and stored efficiently. We propose and analyze an approximate dynamic programming approach for fitting the Q-function based on linear programming. We show that error bounds previously developed for cost-to-go function approximation via linear programming can be extended to the case of Q-function approximation. We then consider the problem of decentralizing the task of approximating the Q-function and show that it can be viewed as a resource allocation problem. Motivated by this observation, we propose a decentralized gradient-based algorithm for solving a class of resource allocation problems. Convergence of the algorithm is established and its convergence rate, measured in terms of the number of iterations required for magnitude of the gradient to approach zero, is shown to be O(n(2.5)), where n is the number of agents in the network.
引用
收藏
页码:1648 / +
页数:2
相关论文
共 50 条
  • [31] On approximate dynamic programming in switching systems
    Rantzer, Anders
    2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 1391 - 1396
  • [32] Approximate Dynamic Programming for Ambulance Redeployment
    Maxwell, Matthew S.
    Restrepo, Mateo
    Henderson, Shane G.
    Topaloglu, Huseyin
    INFORMS JOURNAL ON COMPUTING, 2010, 22 (02) : 266 - 281
  • [33] Stochastic Transactive Control for Electric Vehicle Aggregators Coordination: A Decentralized Approximate Dynamic Programming Approach
    Pan, Zhenning
    Yu, Tao
    Li, Jie
    Qu, Kaiping
    Chen, Lvpeng
    Yang, Bo
    Guo, Wenxin
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4261 - 4277
  • [34] Dynamic Site Layout Planning Using Approximate Dynamic Programming
    El-Rayes, Khaled
    Said, Hisham
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2009, 23 (02) : 119 - 127
  • [35] An approximate dynamic programming approach to solving dynamic oligopoly models
    Farias, Vivek
    Saure, Denis
    Weintraub, Gabriel Y.
    RAND JOURNAL OF ECONOMICS, 2012, 43 (02): : 253 - 282
  • [36] An Approximate Dynamic Programming Approach to the Dynamic Traveling Repairperson Problem
    Shin, Hyung Sik
    Lall, Sanjay
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2286 - 2291
  • [37] Approximate Dynamic Programming via Sum of Squares Programming
    Summers, Tyler H.
    Kunz, Konstantin
    Kariotoglou, Nikolaos
    Kamgarpour, Maryam
    Summers, Sean
    Lygeros, John
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 191 - 197
  • [38] Decentralized PMU Placements in a Dynamic Programming Approach
    Guo, Xian-Chang
    Liao, Chung-Shou
    Chu, Chia-Chi
    2019 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2019,
  • [39] Performance comparison of approximate dynamic programming techniques for dynamic stochastic scheduling
    Gocgun, Yasin
    INTERNATIONAL JOURNAL OF OPTIMIZATION AND CONTROL-THEORIES & APPLICATIONS-IJOCTA, 2021, 11 (02): : 178 - 185
  • [40] Decentralized Waveform Co-design for Integrated Sensing and Communications systems via Approximate Dynamic Programming
    Doly, Shammi A.
    Chiriyath, Alex R.
    Herschfelt, Andrew
    Azam, Md Ali
    Ragi, Shankarachary
    Bliss, Daniel W.
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 831 - 834