Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties

被引:32
|
作者
Cai, Mingyu [1 ]
Peng, Hao [2 ]
Li, Zhijun [3 ]
Kan, Zhen [3 ]
机构
[1] Univ Iowa, Dept Mech Engn, Iowa City, IA 52246 USA
[2] ApexAI Inc, Palo Alto, CA 94303 USA
[3] Univ Sci & Technol China, Dept Automat, Hefei 230052, Peoples R China
关键词
Uncertainty; Probabilistic logic; Task analysis; Planning; Learning automata; Markov processes; Autonomous agents; Linear temporal logic (LTL); Markov decision process (MDP); motion planning; reinforcement learning; MARKOV DECISION-PROCESSES; LOGIC; FRAMEWORK;
D O I
10.1109/TAC.2020.3006967
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers control synthesis of an autonomous agent with linear temporal logic (LTL) specifications subject to environment and motion uncertainties. Specifically, the probabilistic motion of the agent is modeled by a Markov decision process (MDP) with unknown transition probabilities. The operating environment is assumed to be partially known, where the desired LTL specifications might be partially infeasible. A relaxed product MDP is constructed that allows the agent to revise its motion plan without strictly following the desired LTL constraints. A utility function composed of violation cost and state rewards is developed. Rigorous analysis shows that, if there almost surely (i.e., with probability 1) exists a policy that satisfies the relaxed product MDP, any algorithm that optimizes the expected utility is guaranteed to find such a policy. A reinforcement learning-based approach is then developed to generate policies that fulfill the desired LTL specifications as much as possible by optimizing the expected discount utility of the relaxed product MDP.
引用
收藏
页码:2386 / 2392
页数:7
相关论文
共 50 条
  • [41] Learning-Based Near-Optimal Motion Planning for Intelligent Vehicles With Uncertain Dynamics
    Lu, Yang
    Zhang, Xinglong
    Xu, Xin
    Yao, Weijia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1532 - 1539
  • [42] Control of Probabilistic Diffusion in Motion Planning
    Dalibard, Sebastien
    Laumond, Jean-Paul
    ALGORITHMIC FOUNDATIONS OF ROBOTICS VIII, 2010, 57 : 467 - 481
  • [43] Probabilistic motion planning for parallel mechanisms
    Cortés, J
    Siméon, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 4354 - 4359
  • [44] LTL-Based Planning in Environments With Probabilistic Observations
    Kloetzer, Marius
    Mahulea, Cristian
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 12 (04) : 1407 - 1420
  • [45] Receding Horizon Control-Based Motion Planning With Partially Infeasible LTL Constraints
    Cai, Mingyu
    Peng, Hao
    Li, Zhijun
    Gao, Hongbo
    Kan, Zhen
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (04): : 1279 - 1284
  • [46] GPU-based Motion Planning under Uncertainties using POMDP
    Lee, Taekhee
    Kim, Young J.
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4576 - 4581
  • [47] Learning-Based Safety-Critical Motion Planning with Input-to-State Barrier Certificate
    Jin, Xinze
    Jia, Qing-Shan
    Zhang, Tao
    Xia, Huaxia
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1967 - 1972
  • [48] Distributionally Robust Risk Map for Learning-Based Motion Planning and Control: A Semidefinite Programming Approach
    Hakobyan, Astghik
    Yang, Insoon
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 718 - 737
  • [49] Adaptive Hybrid Optimization Learning-Based Accurate Motion Planning of Multi-Joint Arm
    Bai, Chengchao
    Zhang, Jiawei
    Guo, Jifeng
    Yue, C. Patrick
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 5440 - 5451
  • [50] Deep Learning-Based NMPC for Local Motion Planning of Last-Mile Delivery Robot
    Imad, Muhammad
    Doukhi, Oualid
    Lee, Deok Jin
    Kim, Ji Chul
    Kim, Yeong Jae
    SENSORS, 2022, 22 (21)