Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives

被引：2

作者：

Hashemi, Navid ^{[1
]}

Qin, Xin ^{[1
]}

Deshmukh, Jyotirmoy V. ^{[1
]}

Fainekos, Georgios ^{[2
]}

Hoxha, Bardh ^{[2
]}

Prokhorov, Danil ^{[2
]}

Yamaguchi, Tomoya ^{[2
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90007 USA

[2] Toyota Motor North Amer R&D, Saline, MI USA

来源：

2023 AMERICAN CONTROL CONFERENCE, ACC | 2023年

基金：

美国国家科学基金会;

关键词：

D O I：

10.23919/ACC55779.2023.10156345

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider the problem of synthesizing a controller in the presence of uncertainty such that the resulting closed-loop system satisfies certain hard constraints while optimizing certain (soft) performance objectives. We assume that the hard constraints encoding safety or mission-critical specifications are expressed using Signal Temporal Logic (STL), while performance is quantified using standard cost functions on system trajectories. To ensure satisfaction of the STL constraints, we algorithmically obtain control barrier functions (CBFs) from the STL specifications. We model controllers as neural networks (NNs) and provide an algorithm to train the NN parameters to simultaneously optimize the performance objectives while satisfying the CBF conditions (with a user-specified robustness margin). We evaluate the risk incurred by the trade-off between the robustness margin of the system and its performance using the formalism of risk measures. We demonstrate our approach on challenging nonlinear control examples such as quadcopter motion planning and a unicycle.

引用

页码：4096 / 4103

页数：8

共 50 条

[41] Genetic algorithm design of neural network and fuzzy logic controllers
A. Hunter
K.-S. Chiu
Soft Computing, 2000, 4 (3) : 186 - 192
[42] Software Based on Logic Neural Networks for Digital Controllers Design
Pop, Emil
Leba, Monica
Pop, Maria
Sochirca, Bogdan
Badea, Alin
PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING (CSECS'09), 2009, : 168 - +
[43] Neural Logic Reinforcement Learning
Jiang, Zhengyao
Luo, Shan
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[44] Robust Satisfaction of Metric Interval Temporal Logic Objectives in Adversarial Environments
Niu, Luyao
Ramasubramanian, Bhaskar
Clark, Andrew
Poovendran, Radha
GAMES, 2023, 14 (02):
[45] Programmable Logic Controllers Past Linear Temporal Logic for Monitoring Applications in Industrial Control Systems
Mao, Xia
Li, Xin
Huang, Yanhong
Shi, Jianqi
Zhang, Yueling
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (07) : 4393 - 4405
[46] Temporal binding and the neural correlates of sensory awareness
Engel, AK
Singer, W
TRENDS IN COGNITIVE SCIENCES, 2001, 5 (01) : 16 - 25
[47] Anomaly Detection Based on Temporal Behavior Monitoring in Programmable Logic Controllers
Han, Seungjae
Lee, Keonyong
Cho, Seongje
Park, Moonju
ELECTRONICS, 2021, 10 (10)
[48] Temporal Execution Behavior for Host Anomaly Detection in Programmable Logic Controllers
Formby, David
Beyah, Raheem
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 1455 - 1469
[49] Reinforcement Learning With Temporal Logic Rewards
Li, Xiao
Vasile, Cristian-Ioan
Belta, Calin
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3834 - 3839
[50] Compositional Learning and Verification of Neural Network Controllers
Ivanov, Radoslav
Jothimurugan, Kishor
Hsu, Steve
Vaidya, Shaan
Alur, Rajeev
Bastani, Osbert
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)

← 1 2 3 4 5 →