Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives

被引:2
|
作者
Hashemi, Navid [1 ]
Qin, Xin [1 ]
Deshmukh, Jyotirmoy V. [1 ]
Fainekos, Georgios [2 ]
Hoxha, Bardh [2 ]
Prokhorov, Danil [2 ]
Yamaguchi, Tomoya [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Toyota Motor North Amer R&D, Saline, MI USA
基金
美国国家科学基金会;
关键词
D O I
10.23919/ACC55779.2023.10156345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of synthesizing a controller in the presence of uncertainty such that the resulting closed-loop system satisfies certain hard constraints while optimizing certain (soft) performance objectives. We assume that the hard constraints encoding safety or mission-critical specifications are expressed using Signal Temporal Logic (STL), while performance is quantified using standard cost functions on system trajectories. To ensure satisfaction of the STL constraints, we algorithmically obtain control barrier functions (CBFs) from the STL specifications. We model controllers as neural networks (NNs) and provide an algorithm to train the NN parameters to simultaneously optimize the performance objectives while satisfying the CBF conditions (with a user-specified robustness margin). We evaluate the risk incurred by the trade-off between the robustness margin of the system and its performance using the formalism of risk measures. We demonstrate our approach on challenging nonlinear control examples such as quadcopter motion planning and a unicycle.
引用
收藏
页码:4096 / 4103
页数:8
相关论文
共 50 条
  • [41] Genetic algorithm design of neural network and fuzzy logic controllers
    A. Hunter
    K.-S. Chiu
    Soft Computing, 2000, 4 (3) : 186 - 192
  • [42] Software Based on Logic Neural Networks for Digital Controllers Design
    Pop, Emil
    Leba, Monica
    Pop, Maria
    Sochirca, Bogdan
    Badea, Alin
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING (CSECS'09), 2009, : 168 - +
  • [43] Neural Logic Reinforcement Learning
    Jiang, Zhengyao
    Luo, Shan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [44] Robust Satisfaction of Metric Interval Temporal Logic Objectives in Adversarial Environments
    Niu, Luyao
    Ramasubramanian, Bhaskar
    Clark, Andrew
    Poovendran, Radha
    GAMES, 2023, 14 (02):
  • [45] Programmable Logic Controllers Past Linear Temporal Logic for Monitoring Applications in Industrial Control Systems
    Mao, Xia
    Li, Xin
    Huang, Yanhong
    Shi, Jianqi
    Zhang, Yueling
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (07) : 4393 - 4405
  • [46] Temporal binding and the neural correlates of sensory awareness
    Engel, AK
    Singer, W
    TRENDS IN COGNITIVE SCIENCES, 2001, 5 (01) : 16 - 25
  • [47] Anomaly Detection Based on Temporal Behavior Monitoring in Programmable Logic Controllers
    Han, Seungjae
    Lee, Keonyong
    Cho, Seongje
    Park, Moonju
    ELECTRONICS, 2021, 10 (10)
  • [48] Temporal Execution Behavior for Host Anomaly Detection in Programmable Logic Controllers
    Formby, David
    Beyah, Raheem
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 1455 - 1469
  • [49] Reinforcement Learning With Temporal Logic Rewards
    Li, Xiao
    Vasile, Cristian-Ioan
    Belta, Calin
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3834 - 3839
  • [50] Compositional Learning and Verification of Neural Network Controllers
    Ivanov, Radoslav
    Jothimurugan, Kishor
    Hsu, Steve
    Vaidya, Shaan
    Alur, Rajeev
    Bastani, Osbert
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)