Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition

被引:1
|
作者
Liu, Xuan [1 ]
Fu, Jie [1 ]
机构
[1] Worcester Polytech Inst, Dept Elect & Comp Engn, Robot Engn Program, Worcester, MA 01609 USA
来源
2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年
基金
美国国家科学基金会;
关键词
FRAMEWORK; LTL;
D O I
10.23919/acc.2019.8814646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In hierarchical planning for Markov decision processes (MDPs), temporal abstraction allows planning with macro-actions that take place at different time scale in form of sequential composition. In this paper, we propose a novel approach to compositional reasoning and hierarchical planning for MDPs under co-safe temporal logic constraints. In addition to sequential composition, we introduce a composition of policies based on generalized logic composition: Given sub-policies for sub-tasks and a new task expressed as logic compositions of subtasks, a semi-optimal policy, which is optimal in planning with only sub-policies, can be obtained by simply composing sub-polices. Thus, a synthesis algorithm is developed to compute optimal policies efficiently by planning with primitive actions, policies for sub-tasks, and the compositions of sub-policies, for maximizing the probability of satisfying constraints specified in the fragment of co-safe temporal logic. We demonstrate the correctness and efficiency of the proposed method in stochastic planning examples with a single agent and multiple task specifications.
引用
收藏
页码:559 / 566
页数:8
相关论文
共 50 条
  • [21] CEGAR for compositional analysis of qualitative properties in Markov decision processes
    Chatterjee, Krishnendu
    Chmelik, Martin
    Daca, Przemyslaw
    FORMAL METHODS IN SYSTEM DESIGN, 2015, 47 (02) : 230 - 264
  • [22] A Lazy Abstraction Algorithm for Markov Decision Processes Theory and Initial Evaluation
    Szekeres, Daniel
    Marussy, Kristof
    Majzik, Istvan
    ANALYTICAL AND STOCHASTIC MODELLING TECHNIQUES AND APPLICATIONS, ASMTA 2024, 2025, 14826 : 81 - 96
  • [23] A Generalized Reduced Linear Program for Markov Decision Processes
    Lakshminarayanan, Chandrashekar
    Bhatnagar, Shalabh
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2722 - 2728
  • [24] GENERALIZED SEMI-MARKOV DECISION-PROCESSES
    DOSHI, BT
    JOURNAL OF APPLIED PROBABILITY, 1979, 16 (03) : 618 - 630
  • [25] Optimal Control of Discounted-Reward Markov Decision Processes Under Linear Temporal Logic Specifications
    Kalagarla, Krishna C.
    Jain, Rahul
    Nuzzo, Pierluigi
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1268 - 1274
  • [26] Multiagent, Multitarget Path Planning in Markov Decision Processes
    Nawaz, Farhad
    Ornik, Melkior
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
  • [27] Approximate planning and verification for large Markov decision processes
    Lassaigne, Richard
    Peyronnet, Sylvain
    INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2015, 17 (04) : 457 - 467
  • [28] Oblivious Markov Decision Processes: Planning and Policy Execution
    Alsayegh, Murtadha
    Fuentes, Jose
    Bobadilla, Leonardo
    Shell, Dylan A.
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3850 - 3857
  • [29] From Dissipativity Theory to Compositional Construction of Finite Markov Decision Processes
    Lavaei, Abolfazl
    Soudjani, Sadegh
    Zamani, Majid
    HSCC 2018: PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS WEEK), 2018, : 21 - 30
  • [30] Planning using hierarchical constrained Markov decision processes
    Seyedshams Feyzabadi
    Stefano Carpin
    Autonomous Robots, 2017, 41 : 1589 - 1607