Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties

被引:32
|
作者
Cai, Mingyu [1 ]
Peng, Hao [2 ]
Li, Zhijun [3 ]
Kan, Zhen [3 ]
机构
[1] Univ Iowa, Dept Mech Engn, Iowa City, IA 52246 USA
[2] ApexAI Inc, Palo Alto, CA 94303 USA
[3] Univ Sci & Technol China, Dept Automat, Hefei 230052, Peoples R China
关键词
Uncertainty; Probabilistic logic; Task analysis; Planning; Learning automata; Markov processes; Autonomous agents; Linear temporal logic (LTL); Markov decision process (MDP); motion planning; reinforcement learning; MARKOV DECISION-PROCESSES; LOGIC; FRAMEWORK;
D O I
10.1109/TAC.2020.3006967
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers control synthesis of an autonomous agent with linear temporal logic (LTL) specifications subject to environment and motion uncertainties. Specifically, the probabilistic motion of the agent is modeled by a Markov decision process (MDP) with unknown transition probabilities. The operating environment is assumed to be partially known, where the desired LTL specifications might be partially infeasible. A relaxed product MDP is constructed that allows the agent to revise its motion plan without strictly following the desired LTL constraints. A utility function composed of violation cost and state rewards is developed. Rigorous analysis shows that, if there almost surely (i.e., with probability 1) exists a policy that satisfies the relaxed product MDP, any algorithm that optimizes the expected utility is guaranteed to find such a policy. A reinforcement learning-based approach is then developed to generate policies that fulfill the desired LTL specifications as much as possible by optimizing the expected discount utility of the relaxed product MDP.
引用
收藏
页码:2386 / 2392
页数:7
相关论文
共 50 条
  • [31] Visibility-based probabilistic roadmaps for motion planning
    Siméon, T
    Laumond, JP
    Nissoux, C
    ADVANCED ROBOTICS, 2000, 14 (06) : 477 - 493
  • [32] PROMPT: Probabilistic Motion Primitives based Trajectory Planning
    Löw, Tobias
    Bandyopadhyay, Tirthankar
    Williams, Jason
    Borges, Paulo V K.
    Robotics: Science and Systems, 2021,
  • [33] PROMPT: Probabilistic Motion Primitives based Trajectory Planning
    Loew, Tobias
    Bandyopadhyay, Tirthankar
    Williams, Jason
    Borges, Paulo V. K.
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [34] Learning-based Axial Video Motion Magnification
    Byung-Ki, Kwon
    Hyun-Bin, Oh
    Jun-Seong, Kim
    Ha, Hyunwoo
    Oh, Tae-Hyun
    COMPUTER VISION - ECCV 2024, PT LIV, 2025, 15112 : 179 - 195
  • [35] Learning-based inverse dynamics of human motion
    Zell, Petrissa
    Rosenhahn, Bodo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 842 - 850
  • [36] Probabilistic Uncertainty Modeling of Obstacle Motion for Robot Motion Planning
    Miura, Jun
    Shirai, Yoshiaki
    Journal of Robotics and Mechatronics, 2002, 14 (04) : 349 - 356
  • [37] Multi-Robot Motion Planning: A Learning-Based Artificial Potential Field Solution
    Zhang, Dengyu
    Zhu, Guobin
    Zhang, Qingrui
    2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 233 - 238
  • [38] Synergistic Task and Motion Planning With Reinforcement Learning-Based Non-Prehensile Actions
    Liu, Gaoyuan
    de Winter, Joris
    Steckelmacher, Denis
    Hota, Roshan Kumar
    Nowe, Ann
    Vanderborght, Bram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2764 - 2771
  • [39] Reinforcement Learning-based Motion Planning of a Triangular Floating Platform under Environmental Disturbances
    Tziortziotis, Konstantinos
    Vlachos, Kostas
    Blekas, Konstandnos
    2016 24TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2016, : 1014 - 1019
  • [40] Deep learning-based optimization for motion planning of dual-arm assembly robots
    Ying, Kuo-Ching
    Pourhejazy, Pourya
    Cheng, Chen-Yang
    Cai, Zong-Ying
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 160