Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms

被引:0
|
作者
de Nijs, Frits [1 ]
Walraven, Erwin [2 ]
de Weerdt, Mathijs M. [2 ]
Spaan, Matthijs T. J. [2 ]
机构
[1] Monash Univ, Fac IT, Dept Data Sci & AI, 20 Exhibit Walk, Clayton, Vic 3168, Australia
[2] Delft Univ Technol, Van Mourik Broekmanweg 6, NL-2628 XE Delft, Netherlands
关键词
OPTIMAL POLICIES; COMPLEXITY; CHAINS; AGENTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.
引用
收藏
页码:955 / 1001
页数:47
相关论文
共 50 条
  • [31] Constrained Markov decision processes with first passage criteria
    Yonghui Huang
    Qingda Wei
    Xianping Guo
    Annals of Operations Research, 2013, 206 : 197 - 219
  • [32] Stochastic approximations of constrained discounted Markov decision processes
    Dufour, Francois
    Prieto-Rumeau, Tomas
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2014, 413 (02) : 856 - 879
  • [33] Constrained Markov decision processes with first passage criteria
    Huang, Yonghui
    Wei, Qingda
    Guo, Xianping
    ANNALS OF OPERATIONS RESEARCH, 2013, 206 (01) : 197 - 219
  • [34] STOCHASTIC DOMINANCE-CONSTRAINED MARKOV DECISION PROCESSES
    Haskell, William B.
    Jain, Rahul
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (01) : 273 - 303
  • [35] Joint chance-constrained Markov decision processes
    V Varagapriya
    Vikas Vikram Singh
    Abdel Lisser
    Annals of Operations Research, 2023, 322 : 1013 - 1035
  • [36] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
  • [37] Constrained Risk-Averse Markov Decision Processes
    Ahmadi, Mohamadreza
    Rosolia, Ugo
    Ingham, Michel D.
    Murray, Richard M.
    Ames, Aaron D.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11718 - 11725
  • [38] Planning using hierarchical constrained Markov decision processes
    Feyzabadi, Seyedshams
    Carpin, Stefano
    AUTONOMOUS ROBOTS, 2017, 41 (08) : 1589 - 1607
  • [39] Semi-Infinitely Constrained Markov Decision Processes
    Zhang, Liangyu
    Peng, Yang
    Yang, Wenhao
    Zhang, Zhihua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [40] Planning using hierarchical constrained Markov decision processes
    Seyedshams Feyzabadi
    Stefano Carpin
    Autonomous Robots, 2017, 41 : 1589 - 1607