Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引:0
|
作者
Lacerda, Bruno [1 ]
Parker, David [1 ]
Hawes, Nick [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.
引用
收藏
页码:1511 / 1516
页数:6
相关论文
共 50 条
  • [41] Planning using hierarchical constrained Markov decision processes
    Seyedshams Feyzabadi
    Stefano Carpin
    Autonomous Robots, 2017, 41 : 1589 - 1607
  • [42] Planning using hierarchical constrained Markov decision processes
    Feyzabadi, Seyedshams
    Carpin, Stefano
    AUTONOMOUS ROBOTS, 2017, 41 (08) : 1589 - 1607
  • [43] Probabilistic Preference Planning Problem for Markov Decision Processes
    Li, Meilun
    Turrini, Andrea
    Hahn, Ernst Moritz
    She, Zhikun
    Zhang, Lijun
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (05) : 1545 - 1559
  • [44] Learning and Planning with Timing Information in Markov Decision Processes
    Bacon, Pierre-Luc
    Balle, Borja
    Precup, Doina
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 111 - 120
  • [45] Approximate planning and verification for large Markov decision processes
    Richard Lassaigne
    Sylvain Peyronnet
    International Journal on Software Tools for Technology Transfer, 2015, 17 : 457 - 467
  • [46] Safe Policy Improvement Approaches on Discrete Markov Decision Processes
    Scholl, Philipp
    Dietrich, Felix
    Otte, Clemens
    Udluft, Steffen
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 142 - 151
  • [47] MONOTONE OPTIMAL POLICIES FOR MARKOV DECISION-PROCESSES
    SERFOZO, RF
    MATHEMATICAL PROGRAMMING STUDY, 1976, 6 (DEC): : 202 - 215
  • [48] Monotone optimal control for a class of Markov decision processes
    Zhuang, Weifen
    Li, Michael Z. F.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 217 (02) : 342 - 350
  • [49] Model-Free Reinforcement Learning for Optimal Control of Markov Decision Processes Under Signal Temporal Logic Specifications
    Kalagarla, Krishna C.
    Jain, Rahul
    Nuzzo, Pierluigi
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2252 - 2257
  • [50] Optimal control in light traffic Markov decision processes
    Ger Koole
    Olaf Passchier
    Mathematical Methods of Operations Research, 1997, 45 : 63 - 79