Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引:0
|
作者
Lacerda, Bruno [1 ]
Parker, David [1 ]
Hawes, Nick [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.
引用
收藏
页码:1511 / 1516
页数:6
相关论文
共 50 条
  • [31] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    International Journal of Automation and Computing, 2021, 18 (03) : 410 - 421
  • [32] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
    Sladky, Karel
    KYBERNETIKA, 2010, 46 (03) : 558 - 570
  • [33] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    International Journal of Automation and Computing, 2021, 18 : 410 - 421
  • [34] Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications
    Wolff, Eric M.
    Topcu, Ufuk
    Murray, Richard M.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3372 - 3379
  • [35] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
  • [36] Dynamic Regret of Online Markov Decision Processes
    Zhao, Peng
    Li, Long-Fei
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [37] Dynamic Watermarking for Finite Markov Decision Processes
    Tang, Jiacheng
    Song, Jiguo
    Gupta, Abhishek
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2025, 4 : 41 - 52
  • [38] Multiagent, Multitarget Path Planning in Markov Decision Processes
    Nawaz, Farhad
    Ornik, Melkior
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
  • [39] Approximate planning and verification for large Markov decision processes
    Lassaigne, Richard
    Peyronnet, Sylvain
    INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2015, 17 (04) : 457 - 467
  • [40] Oblivious Markov Decision Processes: Planning and Policy Execution
    Alsayegh, Murtadha
    Fuentes, Jose
    Bobadilla, Leonardo
    Shell, Dylan A.
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3850 - 3857