A Dynamic Programming Algorithm for Decentralized Markov Decision Processes with a Broadcast Structure

被引:13
|
作者
Wu, Jeff [1 ]
Lall, Sanjay [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
关键词
COMPLEXITY;
D O I
10.1109/CDC.2010.5718187
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast information structure that consists of a central node that only has access to its own state but can affect several outer nodes, while each outer node has access to both its own state and the central node's state, but cannot affect the other nodes. The solution to this problem involves a dynamic program similar to that of a centralized partially-observed Markov decision process.
引用
收藏
页码:6143 / 6148
页数:6
相关论文
共 50 条
  • [1] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
  • [2] Risk-averse dynamic programming for Markov decision processes
    Ruszczynski, Andrzej
    MATHEMATICAL PROGRAMMING, 2010, 125 (02) : 235 - 261
  • [3] Risk-averse dynamic programming for Markov decision processes
    Andrzej Ruszczyński
    Mathematical Programming, 2010, 125 : 235 - 261
  • [4] Time aggregated Markov decision processes via standard dynamic programming
    Arruda, Edilson F.
    Fragoso, Marcelo D.
    OPERATIONS RESEARCH LETTERS, 2011, 39 (03) : 193 - 197
  • [5] Standard Dynamic Programming Applied to Time Aggregated Markov Decision Processes
    Arruda, Edilson F.
    Fragoso, Marcelo D.
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 2576 - 2580
  • [6] On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
    Hau, Jia Lin
    Delage, Erick
    Ghavamzadeh, Mohammad
    Petrik, Marek
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Erratum to: Risk-averse dynamic programming for Markov decision processes
    Andrzej Ruszczyński
    Mathematical Programming, 2014, 145 : 601 - 604
  • [8] Relaxation for Constrained Decentralized Markov Decision Processes
    Xu, Jie
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1313 - 1314
  • [9] The complexity of decentralized control of Markov decision processes
    Bernstein, DS
    Givan, R
    Immerman, N
    Zilberstein, S
    MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) : 819 - 840
  • [10] Quantitative Programming and Markov Decision Processes
    Todoran, Eneia Nicolae
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 117 - 124