Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties

被引:32
|
作者
Cai, Mingyu [1 ]
Peng, Hao [2 ]
Li, Zhijun [3 ]
Kan, Zhen [3 ]
机构
[1] Univ Iowa, Dept Mech Engn, Iowa City, IA 52246 USA
[2] ApexAI Inc, Palo Alto, CA 94303 USA
[3] Univ Sci & Technol China, Dept Automat, Hefei 230052, Peoples R China
关键词
Uncertainty; Probabilistic logic; Task analysis; Planning; Learning automata; Markov processes; Autonomous agents; Linear temporal logic (LTL); Markov decision process (MDP); motion planning; reinforcement learning; MARKOV DECISION-PROCESSES; LOGIC; FRAMEWORK;
D O I
10.1109/TAC.2020.3006967
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers control synthesis of an autonomous agent with linear temporal logic (LTL) specifications subject to environment and motion uncertainties. Specifically, the probabilistic motion of the agent is modeled by a Markov decision process (MDP) with unknown transition probabilities. The operating environment is assumed to be partially known, where the desired LTL specifications might be partially infeasible. A relaxed product MDP is constructed that allows the agent to revise its motion plan without strictly following the desired LTL constraints. A utility function composed of violation cost and state rewards is developed. Rigorous analysis shows that, if there almost surely (i.e., with probability 1) exists a policy that satisfies the relaxed product MDP, any algorithm that optimizes the expected utility is guaranteed to find such a policy. A reinforcement learning-based approach is then developed to generate policies that fulfill the desired LTL specifications as much as possible by optimizing the expected discount utility of the relaxed product MDP.
引用
收藏
页码:2386 / 2392
页数:7
相关论文
共 50 条
  • [21] Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
    Zhang, Ruipeng
    Yu, Chenning
    Chen, Jingkai
    Fan, Chuchu
    Gao, Sicun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [22] A Dataset Generation Tool for Deep learning-based Motion Planning in Complex Environments
    Sarwar, Muhammad Usman
    Sohail, Moman
    Ud Din, Muhayy
    Rosell, Jan
    Qazi, Wajahat M.
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [23] Learning-based Robust Motion Planning With Guaranteed Stability: A Contraction Theory Approach
    Tsukamoto, Hiroyasu
    Chung, Soon-Jo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6164 - 6171
  • [24] Cooperation in the Air A Learning-Based Approach for the Efficient Motion Planning of Aerial Manipulators
    Kim, Hyoin
    Seo, Hoseong
    Son, Clark Youngdong
    Lee, Hyeonbeom
    Kim, Suseong
    Kim, H. Jin
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2018, 25 (04) : 76 - 85
  • [25] Safe Motion Planning for Autonomous Vehicles by Quantifying Uncertainties of Deep Learning-Enabled Environment Perception
    Li, Dachuan
    Liu, Bowen
    Huang, Zijian
    Hao, Qi
    Zhao, Dezong
    Tian, Bin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2318 - 2332
  • [26] LTL Robot Motion Control based on Automata Learning of Environmental Dynamics
    Chen, Yushan
    Tumova, Jana
    Belta, Calin
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 5177 - 5182
  • [27] An LTL-Based Motion and Action Dynamic Planning Method for Autonomous Robot
    Xu, Ning
    Li, Jie
    Niu, Yifeng
    Shen, Lincheng
    IFAC PAPERSONLINE, 2016, 49 (05): : 91 - 96
  • [28] Motion planning for carlike robots using a probabilistic learning approach
    Utrecht Univ, Utrecht, Netherlands
    Int J Rob Res, 2 (119-143):
  • [29] Receding Horizon Control Based Motion Planning with Partially Infeasible LTL Constrains
    Cai, Mingyu
    Peng, Hao
    Li, Zhijun
    Gao, Hongbo
    Kan, Zhen
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021,
  • [30] Motion planning for carlike robots using a probabilistic learning approach
    Svestka, P
    Overmars, MH
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1997, 16 (02): : 119 - 143