A Dynamic Programming Algorithm for Decentralized Markov Decision Processes with a Broadcast Structure

被引:13
|
作者
Wu, Jeff [1 ]
Lall, Sanjay [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
关键词
COMPLEXITY;
D O I
10.1109/CDC.2010.5718187
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast information structure that consists of a central node that only has access to its own state but can affect several outer nodes, while each outer node has access to both its own state and the central node's state, but cannot affect the other nodes. The solution to this problem involves a dynamic program similar to that of a centralized partially-observed Markov decision process.
引用
收藏
页码:6143 / 6148
页数:6
相关论文
共 50 条
  • [31] Dynamic Regret of Online Markov Decision Processes
    Zhao, Peng
    Li, Long-Fei
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Dynamic Watermarking for Finite Markov Decision Processes
    Tang, Jiacheng
    Song, Jiguo
    Gupta, Abhishek
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2025, 4 : 41 - 52
  • [33] DYNAMIC-PROGRAMMING, LINEAR-PROGRAMMING, AND MARKOV-PROCESSES
    HOWARD, RA
    OPERATIONS RESEARCH, 1961, 9 : B34 - B35
  • [34] DYNAMIC PROGRAMMING AND MARKOV PROCESSES - GERMAN - HOWARD,RA
    DINKELBACH, W
    ZEITSCHRIFT FUR BETRIEBSWIRTSCHAFT, 1969, 39 (11): : 758 - 759
  • [35] UAV Formation Shape Control via Decentralized Markov Decision Processes
    Azam, Md Ali
    Mittelmann, Hans D.
    Ragi, Shankarachary
    ALGORITHMS, 2021, 14 (03)
  • [36] Planning in Discrete and Continuous Markov Decision Processes by Probabilistic Programming
    Nitti, Davide
    Belle, Vaishak
    de Raedt, Luc
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 327 - 342
  • [37] LINEAR-PROGRAMMING FORMULATIONS OF MARKOV DECISION-PROCESSES
    NAZARETH, JL
    KULKARNI, RB
    OPERATIONS RESEARCH LETTERS, 1986, 5 (01) : 13 - 16
  • [38] Multilinear and Integer Programming for Markov Decision Processes with Imprecise Probabilities
    Shirota Filho, Ricardo
    Cozman, Fabio Gagliardi
    Trevizan, Felipe Werndl
    de Campos, Cassio Polpo
    de Barros, Leliane Nunes
    ISIPTA 07-PROCEEDINGS OF THE FIFTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY:THEORIES AND APPLICATIONS, 2007, : 395 - +
  • [39] Using Linear Programming for Bayesian Exploration in Markov Decision Processes
    Castro, Pablo Samuel
    Precup, Doina
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2436 - 2441
  • [40] Risk-averse dynamic programming for Markov decision processes (vol 125, pg 235, 2010)
    Ruszczynski, Andrzej
    MATHEMATICAL PROGRAMMING, 2014, 145 (1-2) : 601 - 604