Planning using hierarchical constrained Markov decision processes

被引:6
|
作者
Feyzabadi, Seyedshams [1 ]
Carpin, Stefano [1 ]
机构
[1] Univ Calif, Sch Engn, 5200 North Lake Rd, Merced, CA 95343 USA
关键词
Constrained Markov decision processes; Planning; Uncertainty;
D O I
10.1007/s10514-017-9630-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constrained Markov decision processes offer a principled method to determine policies for sequential stochastic decision problems where multiple costs are concurrently considered. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. Among the reasons for their limited adoption is their computational complexity, since policy computation requires the solution of constrained linear programs with an extremely large number of variables. To overcome this limitation, we propose a hierarchical method to solve large problem instances. States are clustered into macro states and the parameters defining the dynamic behavior and the costs of the clustered model are determined using a Monte Carlo approach. We show that the algorithm we propose to create clustered states maintains valuable properties of the original model, like the existence of a solution for the problem. Our algorithm is validated in various planning problems in simulation and on a mobile robot platform, and we experimentally show that the clustered approach significantly outperforms the non-hierarchical solution while experiencing only moderate losses in terms of objective functions.
引用
收藏
页码:1589 / 1607
页数:19
相关论文
共 50 条
  • [1] Planning using hierarchical constrained Markov decision processes
    Seyedshams Feyzabadi
    Stefano Carpin
    Autonomous Robots, 2017, 41 : 1589 - 1607
  • [2] HCMDP: a Hierarchical Solution to Constrained Markov Decision Processes
    Feyzabadi, Seyedshams
    Carpin, Stefano
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3971 - 3978
  • [3] Online Planning for Large Markov Decision Processes with Hierarchical Decomposition
    Bai, Aijun
    Wu, Feng
    Chen, Xiaoping
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (04)
  • [4] On constrained Markov decision processes
    Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
    不详
    Oper Res Lett, 1 (25-28):
  • [5] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [6] Strategic Planning under Uncertainties via Constrained Markov Decision Processes
    Ding, Xu Chu
    Pinto, Alessandro
    Surana, Amit
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4568 - 4575
  • [7] Global path planning for AUV based on hierarchical Markov decision processes
    Hong, Ye
    Wang, Hong-Jian
    Bian, Xin-Qian
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2008, 20 (09): : 2361 - 2363
  • [8] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
  • [9] Stability-constrained Markov Decision Processes using MPC
    Zanon, Mario
    Gros, Sebastien
    Palladino, Michele
    AUTOMATICA, 2022, 143
  • [10] Planning with Abstract Markov Decision Processes
    Gopalan, Nakul
    desJardins, Marie
    Littman, Michael L.
    MacGlashan, James
    Squire, Shawn
    Tellex, Stefanie
    Winder, John
    Wong, Lawson L. S.
    TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 480 - 488