Decentralized approximate dynamic programming for dynamic networks of agents

被引:6
|
作者
Lakshmanan, Hariharan [1 ]
Pucci de Farias, Daniela [2 ]
机构
[1] MIT, Dept Civil Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ACC.2006.1656455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider control systems consisting of teams of agents operating in stochastic environments and communicating through a network with dynamic topology. An optimal centralized control policy can be derived from the Q-function associated with the problem. However, computing and storing the Q-function is intractable for systems of practical scale, and having a centralized policy may lead to prohibitive requirements on communication between agents. On the other hand, it has been shown that decentralized optimal control is NP-hard even in the case of small systems. Here we propose a general approach for decentralized control based on approximate dynamic programming. We consider approximations to the Q-function via local approximation architectures, which lead to decentralization of the task of choosing control actions and can be computed and stored efficiently. We propose and analyze an approximate dynamic programming approach for fitting the Q-function based on linear programming. We show that error bounds previously developed for cost-to-go function approximation via linear programming can be extended to the case of Q-function approximation. We then consider the problem of decentralizing the task of approximating the Q-function and show that it can be viewed as a resource allocation problem. Motivated by this observation, we propose a decentralized gradient-based algorithm for solving a class of resource allocation problems. Convergence of the algorithm is established and its convergence rate, measured in terms of the number of iterations required for magnitude of the gradient to approach zero, is shown to be O(n(2.5)), where n is the number of agents in the network.
引用
收藏
页码:1648 / +
页数:2
相关论文
共 50 条
  • [1] An approximate dynamic programming approach to decentralized control of stochastic systems
    Cogill, R
    Rotkowitz, M
    Van Roy, B
    Lall, S
    CONTROL OF UNCERTAIN SYSTEMS: MODELLING, APPROXIMATION, AND DESIGN, 2006, 329 : 243 - 256
  • [2] Decentralized Bayesian search using approximate dynamic programming methods
    Zhao, Yijia
    Patek, Stephen D.
    Beling, Peter A.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 970 - 975
  • [3] DECENTRALIZED RESOURCE ALLOCATION IN DYNAMIC NETWORKS OF AGENTS
    Lakshmanan, Hariharan
    De Farias, Daniela Pucci
    SIAM JOURNAL ON OPTIMIZATION, 2008, 19 (02) : 911 - 940
  • [4] Approximate dynamic programming and neural, networks on game hardware
    Meuth, Ryan J.
    Wunsch, DonalI C., II
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 852 - 856
  • [5] Decentralized UAV Swarm Control for Multitarget Tracking using Approximate Dynamic Programming
    Azam, Md Ali
    Dey, Shawon
    Mittelmann, Hans D.
    Ragi, Shankarachary
    2021 IEEE WORLD AI IOT CONGRESS (AIIOT), 2021, : 457 - 461
  • [6] Admission control in UMTS networks based on approximate dynamic programming
    Computer and System Science Department , University of Rome la Sapienza, via Eudossiana 18, 00184 Rome, Italy
    Eur J Control, 2008, 1 (62-75):
  • [7] Approximate dynamic programming for link scheduling in wireless mesh networks
    Papadaki, Katerina
    Friderikos, Vasilis
    COMPUTERS & OPERATIONS RESEARCH, 2008, 35 (12) : 3848 - 3859
  • [8] An approximate dynamic programming approach to admission control in WCDMA networks
    Pietrabissa, Antonio
    Borza, Daniele Anticoli
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, VOLS 1-4, 2006, : 1266 - 1271
  • [9] Resource management in CDMA networks based on approximate dynamic programming
    Papadaki, K
    Friderikos, V
    2005 14TH IEEE WORKSHOP ON LOCAL & METROPOLITAN AREA NETWORKS (LANMAN), 2005, : 148 - 153
  • [10] An Approximate Dynamic Programming Approach to Vehicle Platooning Coordination in Networks
    Xiong, Xi
    Wang, Maonan
    Sun, Dengfeng
    Jin, Li
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16536 - 16547