Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引:0
|
作者
Lacerda, Bruno [1 ]
Parker, David [1 ]
Hawes, Nick [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.
引用
收藏
页码:1511 / 1516
页数:6
相关论文
共 50 条
  • [21] Optimal Control of Discounted-Reward Markov Decision Processes Under Linear Temporal Logic Specifications
    Kalagarla, Krishna C.
    Jain, Rahul
    Nuzzo, Pierluigi
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1268 - 1274
  • [22] A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
    Michael Kearns
    Yishay Mansour
    Andrew Y. Ng
    Machine Learning, 2002, 49 : 193 - 208
  • [23] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    Kearns, M
    Mansour, Y
    Ng, AY
    MACHINE LEARNING, 2002, 49 (2-3) : 193 - 208
  • [24] Approximately-Optimal Queries for Planning in Reward-Uncertain Markov Decision Processes
    Zhang, Shun
    Durfee, Edmund
    Singh, Satinder
    TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 339 - 347
  • [25] Optimal decisions for continuous time Markov decision processes over finite planning horizons
    Buchholz, Peter
    Dohndorf, Iryna
    Scheftelowitsch, Dimitri
    COMPUTERS & OPERATIONS RESEARCH, 2017, 77 : 267 - 278
  • [26] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    Kearns, M
    Mansour, Y
    Ng, AY
    IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 1324 - 1331
  • [27] On Querying for Safe Optimality in Factored Markov Decision Processes
    Zhang, Shun
    Durfee, Edmund H.
    Singh, Satinder
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 2168 - 2170
  • [28] Finding Safe Zones of Markov Decision Processes Policies
    Cohen, Lee
    Mansour, Yishay
    Moshkovitz, Michal
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [29] Optimal Policies for Quantum Markov Decision Processes
    Ying, Ming-Sheng
    Feng, Yuan
    Ying, Sheng-Gang
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (03) : 410 - 421
  • [30] Optimal adaptive policies for Markov decision processes
    Burnetas, AN
    Katehakis, MN
    MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (01) : 222 - 255