Verifying Learning-Based Robotic Navigation Systems

被引：6

作者：

Amir, Guy ^{[1
]}

Corsi, Davide ^{[2
]}

Yerushalmi, Raz ^{[1
,3
]}

Marzari, Luca ^{[2
]}

Harel, David ^{[3
]}

Farinelli, Alessandro ^{[2
]}

Katz, Guy ^{[1
]}

机构：

[1] Hebrew Univ Jerusalem, Jerusalem, Israel

[2] Univ Verona, Verona, Italy

[3] Weizmann Inst Sci, Rehovot, Israel

来源：

TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023 | 2023年 / 13993卷

基金：

以色列科学基金会;

关键词：

REINFORCEMENT; VERIFICATION;

D O I：

10.1007/978-3-031-30823-9_31

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep reinforcement learning (DRL) has become a dominant deep-learning paradigm for tasks where complex policies are learned within reactive systems. Unfortunately, these policies are known to be susceptible to bugs. Despite significant progress in DNN verification, there has been little work demonstrating the use of modern verification tools on real-world, DRL-controlled systems. In this case study, we attempt to begin bridging this gap, and focus on the important task of mapless robotic navigation - a classic robotics problem, in which a robot, usually controlled by a DRL agent, needs to efficiently and safely navigate through an unknown arena towards a target. We demonstrate how modern verification engines can be used for effective model selection, i.e., selecting the best available policy for the robot in question from a pool of candidate policies. Specifically, we use verification to detect and rule out policies that may demonstrate suboptimal behavior, such as collisions and infinite loops. We also apply verification to identify models with overly conservative behavior, thus allowing users to choose superior policies, which might be better at finding shorter paths to a target. To validate our work, we conducted extensive experiments on an actual robot, and confirmed that the suboptimal policies detected by our method were indeed flawed. We also demonstrate the superiority of our verification-driven approach over state-of-the-art, gradient attacks. Our work is the first to establish the usefulness of DNN verification in identifying and filtering out suboptimal DRL policies in real-world robots, and we believe that the methods presented here are applicable to a wide range of systems that incorporate deep-learning-based agents.

引用

页码：607 / 627

页数：21

共 50 条

[31] Exploring Navigation Maps for Learning-Based Motion Prediction
Sclunidt, Julian
Jordan, Julian
Gritschneder, Franz
Monninger, Thomas
Dietmayer, Klaus
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3539 - 3545
[32] Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
Liu, Boyi
Wang, Lujia
Liu, Ming
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 1688 - 1695
[33] Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
Liu, Boyi
Wang, Lujia
Liu, Ming
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4555 - 4562
[34] Learning-based Success Validation for Robotic Assembly Tasks
Laemmle, Arik
Goes, Marlies
Tenbrock, Philipp
2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
[35] Learning-Based Controller Optimization for Repetitive Robotic Tasks
Li, Xiaocong
Zhu, Haiyue
Ma, Jun
Teo, Tat Joo
Teo, Chek Sing
Tomizuka, Masayoshi
Lee, Tong Heng
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7617 - 7622
[36] Learning-Based Automation of Robotic Assembly for Smart Manufacturing
Ji, Sanghoon
Lee, Sukhan
Yoo, Sujeong
Suh, Ilhong
Kwon, Inso
Park, Frank C.
Lee, Sanghyoung
Kim, Hongseok
PROCEEDINGS OF THE IEEE, 2021, 109 (04) : 423 - 440
[37] Deep Learning-based Object Understanding for Robotic Manipulation
Moon, Jong-Sul
Jo, HyunJun
Song, Jae-Bok
2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 1 - 5
[38] Learning-Based Variable Compliance Control for Robotic Assembly
Ren, Tianyu
Dong, Yunfei
Wu, Dan
Chen, Ken
JOURNAL OF MECHANISMS AND ROBOTICS-TRANSACTIONS OF THE ASME, 2018, 10 (06):
[39] Review of Learning-Based Robotic Manipulation in Cluttered Environments
Mohammed, Marwan Qaid
Kwek, Lee Chung
Chua, Shing Chyi
Al-Dhaqm, Arafat
Nahavandi, Saeid
Eisa, Taiseer Abdalla Elfadil
Miskon, Muhammad Fahmi
Al-Mhiqani, Mohammed Nasser
Ali, Abdulalem
Abaker, Mohammed
Alandoli, Esmail Ali
SENSORS, 2022, 22 (20)
[40] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
Rezwan, Sifat
Chun, Chanjun
Choi, Wooyeol
IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732

← 1 2 3 4 5 →