Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties

被引：32

作者：

Cai, Mingyu ^{[1
]}

Peng, Hao ^{[2
]}

Li, Zhijun ^{[3
]}

Kan, Zhen ^{[3
]}

机构：

[1] Univ Iowa, Dept Mech Engn, Iowa City, IA 52246 USA

[2] ApexAI Inc, Palo Alto, CA 94303 USA

[3] Univ Sci & Technol China, Dept Automat, Hefei 230052, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2021年 / 66卷 / 05期

关键词：

Uncertainty; Probabilistic logic; Task analysis; Planning; Learning automata; Markov processes; Autonomous agents; Linear temporal logic (LTL); Markov decision process (MDP); motion planning; reinforcement learning; MARKOV DECISION-PROCESSES; LOGIC; FRAMEWORK;

D O I：

10.1109/TAC.2020.3006967

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article considers control synthesis of an autonomous agent with linear temporal logic (LTL) specifications subject to environment and motion uncertainties. Specifically, the probabilistic motion of the agent is modeled by a Markov decision process (MDP) with unknown transition probabilities. The operating environment is assumed to be partially known, where the desired LTL specifications might be partially infeasible. A relaxed product MDP is constructed that allows the agent to revise its motion plan without strictly following the desired LTL constraints. A utility function composed of violation cost and state rewards is developed. Rigorous analysis shows that, if there almost surely (i.e., with probability 1) exists a policy that satisfies the relaxed product MDP, any algorithm that optimizes the expected utility is guaranteed to find such a policy. A reinforcement learning-based approach is then developed to generate policies that fulfill the desired LTL specifications as much as possible by optimizing the expected discount utility of the relaxed product MDP.

引用

页码：2386 / 2392

页数：7

共 50 条

[21] Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
Zhang, Ruipeng
Yu, Chenning
Chen, Jingkai
Fan, Chuchu
Gao, Sicun
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[22] A Dataset Generation Tool for Deep learning-based Motion Planning in Complex Environments
Sarwar, Muhammad Usman
Sohail, Moman
Ud Din, Muhayy
Rosell, Jan
Qazi, Wajahat M.
2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
[23] Learning-based Robust Motion Planning With Guaranteed Stability: A Contraction Theory Approach
Tsukamoto, Hiroyasu
Chung, Soon-Jo
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6164 - 6171
[24] Cooperation in the Air A Learning-Based Approach for the Efficient Motion Planning of Aerial Manipulators
Kim, Hyoin
Seo, Hoseong
Son, Clark Youngdong
Lee, Hyeonbeom
Kim, Suseong
Kim, H. Jin
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2018, 25 (04) : 76 - 85
[25] Safe Motion Planning for Autonomous Vehicles by Quantifying Uncertainties of Deep Learning-Enabled Environment Perception
Li, Dachuan
Liu, Bowen
Huang, Zijian
Hao, Qi
Zhao, Dezong
Tian, Bin
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2318 - 2332
[26] LTL Robot Motion Control based on Automata Learning of Environmental Dynamics
Chen, Yushan
Tumova, Jana
Belta, Calin
2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 5177 - 5182
[27] An LTL-Based Motion and Action Dynamic Planning Method for Autonomous Robot
Xu, Ning
Li, Jie
Niu, Yifeng
Shen, Lincheng
IFAC PAPERSONLINE, 2016, 49 (05): : 91 - 96
[28] Motion planning for carlike robots using a probabilistic learning approach
Utrecht Univ, Utrecht, Netherlands
Int J Rob Res, 2 (119-143):
[29] Receding Horizon Control Based Motion Planning with Partially Infeasible LTL Constrains
Cai, Mingyu
Peng, Hao
Li, Zhijun
Gao, Hongbo
Kan, Zhen
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021,
[30] Motion planning for carlike robots using a probabilistic learning approach
Svestka, P
Overmars, MH
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1997, 16 (02): : 119 - 143

← 1 2 3 4 5 →