Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

被引：281

作者：

Brunke, Lukas ^{[1
,2
,3
]}

Greeff, Melissa ^{[1
,2
,3
]}

Hall, Adam W. ^{[1
,2
,3
]}

Yuan, Zhaocong ^{[1
,2
,3
]}

Zhou, Siqi ^{[1
,2
,3
]}

Panerati, Jacopo ^{[1
,2
,3
]}

Schoellig, Angela P. ^{[1
,2
,3
]}

机构：

[1] Univ Toronto, Inst Aerosp Studies, Toronto, ON, Canada

[2] Univ Toronto, Robot Inst, Toronto, ON, Canada

[3] Vector Inst Artificial Intelligence, Toronto, ON, Canada

来源：

ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS | 2022年 / 5卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

safe learning; robotics; robot learning; learning-based control; safe reinforcement learning; adaptive control; robust control; model predictive control; machine learning; benchmarks; MODEL-PREDICTIVE CONTROL; BARRIER FUNCTIONS; TRACKING CONTROL; OPTIMIZATION; EXPLORATION; ROBUSTNESS; ALGORITHMS; FRAMEWORK; SYSTEMS;

D O I：

10.1146/annurev-control-042920-020211

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The last half decade has seen a steep rise in the number of contributions on safe learning methods for real-world robotic deployments from both the control and reinforcement learning communities. This article provides a concise but holistic review of the recent advances made in using machine learning to achieve safe decision-making under uncertainties, with a focus on unifying the language and frameworks used in control theory and reinforcement learning research. It includes learning-based control approaches that safely improve performance by learning the uncertain dynamics, reinforcement learning approaches that encourage safety or robustness, and methods that can formally certify the safety of a learned control policy. As data- and learning-based robot control methods continue to gain traction, researchers must understand when and how to best leverage them in real-world scenarios where safety is imperative, such as when operating in close proximity to humans. We highlight some of the open challenges that will drive the field of robot learning in the coming years, and emphasize the need for realistic physics-based benchmarks to facilitate fair comparisons between control and reinforcement learning approaches.

引用

页码：411 / 444

页数：34

共 50 条

[31] Safe RAN control: A Symbolic Reinforcement Learning Approach
Nikou, Alexandros
Mujumdar, Anusha
Sundararajan, Vaishnavi
Orlic, Marin
Feljan, Aneta Vulgarakis
2022 IEEE 17TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA, 2022, : 332 - 337
[32] Safe Reinforcement Learning Control for Water Distribution Network
Val, Jorge
Wisniewski, Rafal
Kallesoe, Carsten Skovmose
5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 1148 - 1153
[33] Reinforcement Learning for Robotic Safe Control with Force Sensing
Lin, Nan
Zhang, Linrui
Chen, Yuxuan
Zhu, Yujun
Chen, Ruoxi
Wu, Peichen
Chen, Xiaoping
2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 148 - 153
[34] Online Safe Flight Control Method Based on Constraint Reinforcement Learning
Zhao, Jiawei
Xu, Haotian
Wang, Zhaolei
Zhang, Tao
DRONES, 2024, 8 (09)
[35] Safe Reinforcement Learning-based Driving Policy Design for Autonomous Vehicles on Highways
Nguyen, Hung Duy
Han, Kyoungseok
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (12) : 4098 - 4110
[36] A safe reinforcement learning-based charging strategy for electric vehicles in residential microgrid
Zhang, Shulei
Jia, Runda
Pan, Hengxin
Cao, Yankai
APPLIED ENERGY, 2023, 348
[37] Safe Reinforcement Learning-based Driving Policy Design for Autonomous Vehicles on Highways
Hung Duy Nguyen
Kyoungseok Han
International Journal of Control, Automation and Systems, 2023, 21 : 4098 - 4110
[38] Operational Safe Control for Reinforcement-Learning-Based Robot Autonomy
Zhou, Xu
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4091 - 4095
[39] Decentralized safe reinforcement learning for inverter-based voltage control
Cui, Wenqi
Li, Jiayi
Zhang, Baosen
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211
[40] Safe and Accelerated Deep Reinforcement Learning-Based O-RAN Slicing: A Hybrid Transfer Learning Approach
Nagib, Ahmad M.
Abou-Zeid, Hatem
Hassanein, Hossam S.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (02) : 310 - 325

← 1 2 3 4 5 →