Planning using hierarchical constrained Markov decision processes

被引:6
|
作者
Feyzabadi, Seyedshams [1 ]
Carpin, Stefano [1 ]
机构
[1] Univ Calif, Sch Engn, 5200 North Lake Rd, Merced, CA 95343 USA
关键词
Constrained Markov decision processes; Planning; Uncertainty;
D O I
10.1007/s10514-017-9630-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constrained Markov decision processes offer a principled method to determine policies for sequential stochastic decision problems where multiple costs are concurrently considered. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. Among the reasons for their limited adoption is their computational complexity, since policy computation requires the solution of constrained linear programs with an extremely large number of variables. To overcome this limitation, we propose a hierarchical method to solve large problem instances. States are clustered into macro states and the parameters defining the dynamic behavior and the costs of the clustered model are determined using a Monte Carlo approach. We show that the algorithm we propose to create clustered states maintains valuable properties of the original model, like the existence of a solution for the problem. Our algorithm is validated in various planning problems in simulation and on a mobile robot platform, and we experimentally show that the clustered approach significantly outperforms the non-hierarchical solution while experiencing only moderate losses in terms of objective functions.
引用
收藏
页码:1589 / 1607
页数:19
相关论文
共 50 条
  • [31] Joint chance-constrained Markov decision processes
    Varagapriya, V.
    Singh, Vikas Vikram
    Lisser, Abdel
    ANNALS OF OPERATIONS RESEARCH, 2023, 322 (02) : 1013 - 1035
  • [32] Strict-sense constrained Markov decision processes
    Hsu, SP
    Arapostathis, A
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 194 - 199
  • [33] Constrained discounted Markov decision processes and Hamiltonian cycles
    Feinberg, EA
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2821 - 2826
  • [34] Constrained discounted Markov decision processes and Hamiltonian Cycles
    Feinberg, EA
    MATHEMATICS OF OPERATIONS RESEARCH, 2000, 25 (01) : 130 - 140
  • [35] Constrained Markov decision processes with first passage criteria
    Yonghui Huang
    Qingda Wei
    Xianping Guo
    Annals of Operations Research, 2013, 206 : 197 - 219
  • [36] Stochastic approximations of constrained discounted Markov decision processes
    Dufour, Francois
    Prieto-Rumeau, Tomas
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2014, 413 (02) : 856 - 879
  • [37] Constrained Markov decision processes with first passage criteria
    Huang, Yonghui
    Wei, Qingda
    Guo, Xianping
    ANNALS OF OPERATIONS RESEARCH, 2013, 206 (01) : 197 - 219
  • [38] STOCHASTIC DOMINANCE-CONSTRAINED MARKOV DECISION PROCESSES
    Haskell, William B.
    Jain, Rahul
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (01) : 273 - 303
  • [39] Joint chance-constrained Markov decision processes
    V Varagapriya
    Vikas Vikram Singh
    Abdel Lisser
    Annals of Operations Research, 2023, 322 : 1013 - 1035
  • [40] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244