Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

被引:283
|
作者
Brunke, Lukas [1 ,2 ,3 ]
Greeff, Melissa [1 ,2 ,3 ]
Hall, Adam W. [1 ,2 ,3 ]
Yuan, Zhaocong [1 ,2 ,3 ]
Zhou, Siqi [1 ,2 ,3 ]
Panerati, Jacopo [1 ,2 ,3 ]
Schoellig, Angela P. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Inst Aerosp Studies, Toronto, ON, Canada
[2] Univ Toronto, Robot Inst, Toronto, ON, Canada
[3] Vector Inst Artificial Intelligence, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
safe learning; robotics; robot learning; learning-based control; safe reinforcement learning; adaptive control; robust control; model predictive control; machine learning; benchmarks; MODEL-PREDICTIVE CONTROL; BARRIER FUNCTIONS; TRACKING CONTROL; OPTIMIZATION; EXPLORATION; ROBUSTNESS; ALGORITHMS; FRAMEWORK; SYSTEMS;
D O I
10.1146/annurev-control-042920-020211
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The last half decade has seen a steep rise in the number of contributions on safe learning methods for real-world robotic deployments from both the control and reinforcement learning communities. This article provides a concise but holistic review of the recent advances made in using machine learning to achieve safe decision-making under uncertainties, with a focus on unifying the language and frameworks used in control theory and reinforcement learning research. It includes learning-based control approaches that safely improve performance by learning the uncertain dynamics, reinforcement learning approaches that encourage safety or robustness, and methods that can formally certify the safety of a learned control policy. As data- and learning-based robot control methods continue to gain traction, researchers must understand when and how to best leverage them in real-world scenarios where safety is imperative, such as when operating in close proximity to humans. We highlight some of the open challenges that will drive the field of robot learning in the coming years, and emphasize the need for realistic physics-based benchmarks to facilitate fair comparisons between control and reinforcement learning approaches.
引用
收藏
页码:411 / 444
页数:34
相关论文
共 50 条
  • [41] Safe-NORA: Safe Reinforcement Learning-based Mobile Network Resource Allocation for Diverse User Demands
    Huang, Wenzhen
    Li, Tong
    Cao, Yuting
    Lyu, Zhe
    Liang, Yanping
    Yu, Li
    Jin, Depeng
    Zhang, Junge
    Li, Yong
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 885 - 894
  • [42] Safe Learning-Based Control for Multiple UAVs Under Uncertain Disturbances
    Wei, Mingxin
    Zheng, Lanxiang
    Wu, Ying
    Liu, Han
    Cheng, Hui
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 7349 - 7362
  • [43] Safe Learning-based Tracking Control for Quadrotors under Wind Disturbances
    Zheng, Lei
    Yang, Rui
    Pan, Jiesen
    Cheng, Hui
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3638 - 3643
  • [44] What Is Acceptably Safe for Reinforcement Learning?
    Bragg, John
    Habli, Ibrahim
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2018, 2018, 11094 : 418 - 430
  • [45] A comprehensive survey on safe reinforcement learning
    García, Javier
    Fernández, Fernando
    Journal of Machine Learning Research, 2015, 16 : 1437 - 1480
  • [46] Safe Reinforcement Learning for Sepsis Treatment
    Jia, Yan
    Burden, John
    Lawton, Tom
    Habli, Ibrahim
    2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 108 - 114
  • [47] Safe reinforcement learning for dynamical games
    Yang, Yongliang
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (09) : 3706 - 3726
  • [48] Lyapunov design for safe reinforcement learning
    Perkins, TJ
    Barto, AG
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 803 - 832
  • [49] Safe Reinforcement Learning With Dual Robustness
    Li, Zeyang
    Hu, Chuxiong
    Wang, Yunan
    Yang, Yujie
    Li, Shengbo Eben
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10876 - 10890
  • [50] Safe Reinforcement Learning for Legged Locomotion
    Yang, Tsung-Yen
    Zhang, Tingnan
    Luu, Linda
    Ha, Sehoon
    Tan, Jie
    Yu, Wenhao
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2454 - 2461