Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives

被引:2
|
作者
Hashemi, Navid [1 ]
Qin, Xin [1 ]
Deshmukh, Jyotirmoy V. [1 ]
Fainekos, Georgios [2 ]
Hoxha, Bardh [2 ]
Prokhorov, Danil [2 ]
Yamaguchi, Tomoya [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Toyota Motor North Amer R&D, Saline, MI USA
基金
美国国家科学基金会;
关键词
D O I
10.23919/ACC55779.2023.10156345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of synthesizing a controller in the presence of uncertainty such that the resulting closed-loop system satisfies certain hard constraints while optimizing certain (soft) performance objectives. We assume that the hard constraints encoding safety or mission-critical specifications are expressed using Signal Temporal Logic (STL), while performance is quantified using standard cost functions on system trajectories. To ensure satisfaction of the STL constraints, we algorithmically obtain control barrier functions (CBFs) from the STL specifications. We model controllers as neural networks (NNs) and provide an algorithm to train the NN parameters to simultaneously optimize the performance objectives while satisfying the CBF conditions (with a user-specified robustness margin). We evaluate the risk incurred by the trade-off between the robustness margin of the system and its performance using the formalism of risk measures. We demonstrate our approach on challenging nonlinear control examples such as quadcopter motion planning and a unicycle.
引用
收藏
页码:4096 / 4103
页数:8
相关论文
共 50 条
  • [31] Probabilistic Planning with Prioritized Preferences over Temporal Logic Objectives
    Li, Lening
    Rahmani, Hazhar
    Fu, Jie
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 189 - 198
  • [32] Fly-by-Logic: Control of Multi-Drone Fleets with Temporal Logic Objectives
    Pant, Yash Vardhan
    Abbas, Houssam
    Quaye, Rhudii A.
    Mangharam, Rahul
    2018 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2018), 2018, : 186 - 197
  • [33] Using GEARSET to Promote Student Awareness of Learning Objectives
    Bowman, David R.
    Stephan, Elizabeth A.
    2011 ASEE ANNUAL CONFERENCE & EXPOSITION, 2011,
  • [34] Incremental learning of neural reactive controllers
    Castellano, G
    Attolico, G
    Distante, A
    INTELLIGENT SYSTEMS, 1997, : 158 - 162
  • [35] Reactive Controllers for Differentially Flat Systems with Temporal Logic Constraints
    Liu, Jun
    Topcu, Ufuk
    Ozay, Necmiye
    Murray, Richard M.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 7664 - 7670
  • [36] Hierarchical synthesis of hybrid controllers from temporal logic specifications
    Fainekos, Georgios E.
    Girard, Antoine
    Pappas, George J.
    HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2007, 4416 : 203 - +
  • [37] Intelligence involves risk-awareness and intellectual disability involves risk-unawareness: Implications of a theory of common sense
    Greenspan, Stephen
    Switzky, Harvey N.
    Woods, George W.
    JOURNAL OF INTELLECTUAL & DEVELOPMENTAL DISABILITY, 2011, 36 (04): : 242 - 253
  • [38] Risk-awareness in multi-level building evacuation with smoke: Burj Khalifa case study
    Barreiro-Gomez, Julian
    Choutri, Salah Eddine
    Tembine, Hamidou
    AUTOMATICA, 2021, 129
  • [39] New reinforcement learning method for fuzzy logic controllers
    Wang, Zhijie
    Fang, Jian'an
    Shao, Shihuang
    Journal of Dong Hua University (English Edition), 1998, 15 (02): : 42 - 45
  • [40] Implementing fuzzy logic controllers using a neural network framework
    Yager, RR
    FUZZY SETS AND SYSTEMS, 1999, 100 : 133 - 144