Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition

被引:1
|
作者
Liu, Xuan [1 ]
Fu, Jie [1 ]
机构
[1] Worcester Polytech Inst, Dept Elect & Comp Engn, Robot Engn Program, Worcester, MA 01609 USA
基金
美国国家科学基金会;
关键词
FRAMEWORK; LTL;
D O I
10.23919/acc.2019.8814646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In hierarchical planning for Markov decision processes (MDPs), temporal abstraction allows planning with macro-actions that take place at different time scale in form of sequential composition. In this paper, we propose a novel approach to compositional reasoning and hierarchical planning for MDPs under co-safe temporal logic constraints. In addition to sequential composition, we introduce a composition of policies based on generalized logic composition: Given sub-policies for sub-tasks and a new task expressed as logic compositions of subtasks, a semi-optimal policy, which is optimal in planning with only sub-policies, can be obtained by simply composing sub-polices. Thus, a synthesis algorithm is developed to compute optimal policies efficiently by planning with primitive actions, policies for sub-tasks, and the compositions of sub-policies, for maximizing the probability of satisfying constraints specified in the fragment of co-safe temporal logic. We demonstrate the correctness and efficiency of the proposed method in stochastic planning examples with a single agent and multiple task specifications.
引用
收藏
页码:559 / 566
页数:8
相关论文
共 50 条
  • [1] Optimal Control of Markov Decision Processes With Linear Temporal Logic Constraints
    Ding, Xuchu
    Smith, Stephen L.
    Belta, Calin
    Rus, Daniela
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (05) : 1244 - 1257
  • [2] Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications
    Wolff, Eric M.
    Topcu, Ufuk
    Murray, Richard M.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3372 - 3379
  • [3] Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints
    Savas, Yagiz
    Ornik, Melkior
    Cubuktepe, Murat
    Karabag, Mustafa O.
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (04) : 1552 - 1567
  • [4] Compositional reasoning for weighted Markov decision processes
    Deng, Yuxin
    Hennessy, Matthew
    SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (12) : 2537 - 2579
  • [5] Temporal logic control of general Markov decision processes by approximate policy refinement
    Haesaert, Sofie
    Soudjani, Sadegh
    Abate, Alessandro
    IFAC PAPERSONLINE, 2018, 51 (16): : 73 - 78
  • [6] Magnifying-lens abstraction for Markov decision processes
    de Alfaro, Luca
    Roy, Pritam
    COMPUTER AIDED VERIFICATION, PROCEEDINGS, 2007, 4590 : 325 - +
  • [7] Symbolic Magnifying Lens Abstraction in Markov Decision Processes
    Roy, Pritam
    Parker, David
    Norman, Gethin
    de Alfaro, Luca
    QUANTITATIVE EVALUATION OF SYSTEMS: QEST 2008, PROCEEDINGS, 2008, : 103 - +
  • [8] Continuous Simulation Abstraction Refinement for Markov Decision Processes
    Guo, Xu
    Yang, Zongyuan
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 779 - 784
  • [9] Game-based abstraction for Markov decision processes
    Kwiatkowska, Marta
    Norman, Gethin
    Parker, David
    QEST 2006: THIRD INTERNATIONAL CONFERENCE ON THE QUANTITATIVE EVALUATION OF SYSTEMS, 2006, : 157 - +
  • [10] Temporal concatenation for Markov decision processes
    Song, Ruiyang
    Xu, Kuang
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2022, 36 (04) : 999 - 1026