Planning using hierarchical constrained Markov decision processes

被引:6
|
作者
Feyzabadi, Seyedshams [1 ]
Carpin, Stefano [1 ]
机构
[1] Univ Calif, Sch Engn, 5200 North Lake Rd, Merced, CA 95343 USA
关键词
Constrained Markov decision processes; Planning; Uncertainty;
D O I
10.1007/s10514-017-9630-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constrained Markov decision processes offer a principled method to determine policies for sequential stochastic decision problems where multiple costs are concurrently considered. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. Among the reasons for their limited adoption is their computational complexity, since policy computation requires the solution of constrained linear programs with an extremely large number of variables. To overcome this limitation, we propose a hierarchical method to solve large problem instances. States are clustered into macro states and the parameters defining the dynamic behavior and the costs of the clustered model are determined using a Monte Carlo approach. We show that the algorithm we propose to create clustered states maintains valuable properties of the original model, like the existence of a solution for the problem. Our algorithm is validated in various planning problems in simulation and on a mobile robot platform, and we experimentally show that the clustered approach significantly outperforms the non-hierarchical solution while experiencing only moderate losses in terms of objective functions.
引用
收藏
页码:1589 / 1607
页数:19
相关论文
共 50 条
  • [41] Constrained Risk-Averse Markov Decision Processes
    Ahmadi, Mohamadreza
    Rosolia, Ugo
    Ingham, Michel D.
    Murray, Richard M.
    Ames, Aaron D.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11718 - 11725
  • [42] Semi-Infinitely Constrained Markov Decision Processes
    Zhang, Liangyu
    Peng, Yang
    Yang, Wenhao
    Zhang, Zhihua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [43] Hierarchical algorithms for discounted and weighted Markov decision processes
    Abbad, M
    Daoui, C
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
  • [44] Hierarchical algorithms for discounted and weighted Markov decision processes
    M. Abbad
    C. Daoui
    Mathematical Methods of Operations Research, 2003, 58 : 237 - 245
  • [45] Robust path planning for flexible needle insertion using Markov decision processes
    Xiaoyu Tan
    Pengqian Yu
    Kah-Bin Lim
    Chee-Kong Chui
    International Journal of Computer Assisted Radiology and Surgery, 2018, 13 : 1439 - 1451
  • [46] Robust path planning for flexible needle insertion using Markov decision processes
    Tan, Xiaoyu
    Yu, Pengqian
    Lim, Kah-Bin
    Chui, Chee-Kong
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2018, 13 (09) : 1439 - 1451
  • [47] Hierarchical decision making in semiconductor fabs using multi-time scale Markov decision processes
    Panigrahi, JR
    Bhatnagar, S
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4387 - 4392
  • [48] Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms
    de Nijs, Frits
    Walraven, Erwin
    de Weerdt, Mathijs M.
    Spaan, Matthijs T. J.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 955 - 1001
  • [49] Constrained Markov Decision Processes with Total Expected Cost Criteria
    Altman, Eitan
    Boularouk, Said
    Josselin, Didier
    PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
  • [50] Learning algorithms for finite horizon constrained markov decision processes
    Mittal, A.
    Hemachandra, N.
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2007, 3 (03) : 429 - 444