Decentralized approximate dynamic programming for dynamic networks of agents

被引：6

作者：

Lakshmanan, Hariharan ^{[1
]}

Pucci de Farias, Daniela ^{[2
]}

机构：

[1] MIT, Dept Civil Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] MIT, Dept Mech Engn, Cambridge, MA 02139 USA

来源：

2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12 | 2006年 / 1-12卷

关键词：

D O I：

10.1109/ACC.2006.1656455

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider control systems consisting of teams of agents operating in stochastic environments and communicating through a network with dynamic topology. An optimal centralized control policy can be derived from the Q-function associated with the problem. However, computing and storing the Q-function is intractable for systems of practical scale, and having a centralized policy may lead to prohibitive requirements on communication between agents. On the other hand, it has been shown that decentralized optimal control is NP-hard even in the case of small systems. Here we propose a general approach for decentralized control based on approximate dynamic programming. We consider approximations to the Q-function via local approximation architectures, which lead to decentralization of the task of choosing control actions and can be computed and stored efficiently. We propose and analyze an approximate dynamic programming approach for fitting the Q-function based on linear programming. We show that error bounds previously developed for cost-to-go function approximation via linear programming can be extended to the case of Q-function approximation. We then consider the problem of decentralizing the task of approximating the Q-function and show that it can be viewed as a resource allocation problem. Motivated by this observation, we propose a decentralized gradient-based algorithm for solving a class of resource allocation problems. Convergence of the algorithm is established and its convergence rate, measured in terms of the number of iterations required for magnitude of the gradient to approach zero, is shown to be O(n(2.5)), where n is the number of agents in the network.

引用

页码：1648 / +

页数：2

共 50 条

[41] An Approximate Dynamic Programming Approach to Dynamic Pricing for Network Revenue Management
Ke, Jiannan
Zhang, Dan
Zheng, Huan
PRODUCTION AND OPERATIONS MANAGEMENT, 2019, 28 (11) : 2719 - 2737
[42] Robust Approximate Dynamic Programming and Global Stabilization with Nonlinear Dynamic Uncertainties
Jiang, Yu
Jiang, Zhong-Ping
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 115 - 120
[43] Approximate dynamic programming for dynamic capacity allocation with multiple priority levels
Erdelyi, Alexander
Topaloglu, Huseyin
IIE TRANSACTIONS, 2011, 43 (02) : 129 - 142
[44] Decentralized Robust Optimization for Real-time Dispatch of Power System Based on Approximate Dynamic Programming
Wang, Chenxi
Zhu, Jianquan
Zhu, Tao
2018 INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2018, : 1935 - 1941
[45] Automatic formation of dynamic decentralized networks
Poupyrev, P
Yoshida, S
Kuwabara, K
KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1296 - 1302
[46] Decentralized track fusion in dynamic networks
Nicholson, D
Deaves, R
SIGNAL AND DATA PROCESSING OF SMALL TARGETS 2000, 2000, 4048 : 452 - 460
[47] Dynamic decentralized packet clustering in networks
Merkle, D
Middendorf, M
Scheidler, A
APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2005, 3449 : 574 - 583
[48] Alleviating tuning sensitivity in Approximate Dynamic Programming
Beuchat, Paul
Georghiou, Angelos
Lygeros, John
2016 EUROPEAN CONTROL CONFERENCE (ECC), 2016, : 1616 - 1622
[49] Approximate dynamic programming based on expansive projections
Arruda, Edilson R.
do Val, Joao B. R.
PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5540 - +
[50] ADPTriage: Approximate Dynamic Programming for Bug Triage
Jahanshahi H.
Cevik M.
Mousavi K.
Basar A.
IEEE Transactions on Software Engineering, 2023, 49 (10) : 4594 - 4609

← 1 2 3 4 5 →