Verifying Learning-Based Robotic Navigation Systems

被引:6
|
作者
Amir, Guy [1 ]
Corsi, Davide [2 ]
Yerushalmi, Raz [1 ,3 ]
Marzari, Luca [2 ]
Harel, David [3 ]
Farinelli, Alessandro [2 ]
Katz, Guy [1 ]
机构
[1] Hebrew Univ Jerusalem, Jerusalem, Israel
[2] Univ Verona, Verona, Italy
[3] Weizmann Inst Sci, Rehovot, Israel
基金
以色列科学基金会;
关键词
REINFORCEMENT; VERIFICATION;
D O I
10.1007/978-3-031-30823-9_31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep reinforcement learning (DRL) has become a dominant deep-learning paradigm for tasks where complex policies are learned within reactive systems. Unfortunately, these policies are known to be susceptible to bugs. Despite significant progress in DNN verification, there has been little work demonstrating the use of modern verification tools on real-world, DRL-controlled systems. In this case study, we attempt to begin bridging this gap, and focus on the important task of mapless robotic navigation - a classic robotics problem, in which a robot, usually controlled by a DRL agent, needs to efficiently and safely navigate through an unknown arena towards a target. We demonstrate how modern verification engines can be used for effective model selection, i.e., selecting the best available policy for the robot in question from a pool of candidate policies. Specifically, we use verification to detect and rule out policies that may demonstrate suboptimal behavior, such as collisions and infinite loops. We also apply verification to identify models with overly conservative behavior, thus allowing users to choose superior policies, which might be better at finding shorter paths to a target. To validate our work, we conducted extensive experiments on an actual robot, and confirmed that the suboptimal policies detected by our method were indeed flawed. We also demonstrate the superiority of our verification-driven approach over state-of-the-art, gradient attacks. Our work is the first to establish the usefulness of DNN verification in identifying and filtering out suboptimal DRL policies in real-world robots, and we believe that the methods presented here are applicable to a wide range of systems that incorporate deep-learning-based agents.
引用
收藏
页码:607 / 627
页数:21
相关论文
共 50 条
  • [31] Exploring Navigation Maps for Learning-Based Motion Prediction
    Sclunidt, Julian
    Jordan, Julian
    Gritschneder, Franz
    Monninger, Thomas
    Dietmayer, Klaus
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3539 - 3545
  • [32] Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
    Liu, Boyi
    Wang, Lujia
    Liu, Ming
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 1688 - 1695
  • [33] Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
    Liu, Boyi
    Wang, Lujia
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4555 - 4562
  • [34] Learning-based Success Validation for Robotic Assembly Tasks
    Laemmle, Arik
    Goes, Marlies
    Tenbrock, Philipp
    2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
  • [35] Learning-Based Controller Optimization for Repetitive Robotic Tasks
    Li, Xiaocong
    Zhu, Haiyue
    Ma, Jun
    Teo, Tat Joo
    Teo, Chek Sing
    Tomizuka, Masayoshi
    Lee, Tong Heng
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7617 - 7622
  • [36] Learning-Based Automation of Robotic Assembly for Smart Manufacturing
    Ji, Sanghoon
    Lee, Sukhan
    Yoo, Sujeong
    Suh, Ilhong
    Kwon, Inso
    Park, Frank C.
    Lee, Sanghyoung
    Kim, Hongseok
    PROCEEDINGS OF THE IEEE, 2021, 109 (04) : 423 - 440
  • [37] Deep Learning-based Object Understanding for Robotic Manipulation
    Moon, Jong-Sul
    Jo, HyunJun
    Song, Jae-Bok
    2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 1 - 5
  • [38] Learning-Based Variable Compliance Control for Robotic Assembly
    Ren, Tianyu
    Dong, Yunfei
    Wu, Dan
    Chen, Ken
    JOURNAL OF MECHANISMS AND ROBOTICS-TRANSACTIONS OF THE ASME, 2018, 10 (06):
  • [39] Review of Learning-Based Robotic Manipulation in Cluttered Environments
    Mohammed, Marwan Qaid
    Kwek, Lee Chung
    Chua, Shing Chyi
    Al-Dhaqm, Arafat
    Nahavandi, Saeid
    Eisa, Taiseer Abdalla Elfadil
    Miskon, Muhammad Fahmi
    Al-Mhiqani, Mohammed Nasser
    Ali, Abdulalem
    Abaker, Mohammed
    Alandoli, Esmail Ali
    SENSORS, 2022, 22 (20)
  • [40] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
    Rezwan, Sifat
    Chun, Chanjun
    Choi, Wooyeol
    IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732