Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives

被引:2
|
作者
Hashemi, Navid [1 ]
Qin, Xin [1 ]
Deshmukh, Jyotirmoy V. [1 ]
Fainekos, Georgios [2 ]
Hoxha, Bardh [2 ]
Prokhorov, Danil [2 ]
Yamaguchi, Tomoya [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Toyota Motor North Amer R&D, Saline, MI USA
基金
美国国家科学基金会;
关键词
D O I
10.23919/ACC55779.2023.10156345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of synthesizing a controller in the presence of uncertainty such that the resulting closed-loop system satisfies certain hard constraints while optimizing certain (soft) performance objectives. We assume that the hard constraints encoding safety or mission-critical specifications are expressed using Signal Temporal Logic (STL), while performance is quantified using standard cost functions on system trajectories. To ensure satisfaction of the STL constraints, we algorithmically obtain control barrier functions (CBFs) from the STL specifications. We model controllers as neural networks (NNs) and provide an algorithm to train the NN parameters to simultaneously optimize the performance objectives while satisfying the CBF conditions (with a user-specified robustness margin). We evaluate the risk incurred by the trade-off between the robustness margin of the system and its performance using the formalism of risk measures. We demonstrate our approach on challenging nonlinear control examples such as quadcopter motion planning and a unicycle.
引用
收藏
页码:4096 / 4103
页数:8
相关论文
共 50 条
  • [21] Exploiting prospect theory and risk-awareness to protect UAV-assisted network operation
    Vamvakas, Panagiotis
    Tsiropoulou, Eirini Eleni
    Papavassiliou, Symeon
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (01)
  • [22] LB4TL: A Smooth Semantics for Temporal Logic to Train Neural Feedback Controllers
    Hashemi, Navid
    Williams, Samuel
    Hoxha, Bardh
    Prokhorov, Danil
    Fainekos, Georgios
    Deshmukh, Jyotirmoy
    IFAC PAPERSONLINE, 2024, 58 (11): : 183 - 188
  • [23] Learning Decentralized Flocking Controllers with Spatio-Temporal Graph Neural Network
    Chen, Siji
    Su, Yanshen
    Li, Peihan
    Zhou, Lifeng
    Lu, Chang-Tien
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2596 - 2602
  • [24] Decoy Allocation Games on Graphs with Temporal Logic Objectives
    Kulkarni, Abhishek N.
    Fu, Jie
    Luo, Huan
    Kamhoua, Charles A.
    Leslie, Nandi O.
    DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2020, 2020, 12513 : 168 - 187
  • [25] Synthesis of Controllers for Co-Safe Linear Temporal Logic Specifications using Reinforcement Learning
    Ren, Xiaohua
    Yin, Xiang
    Li, Shaoyuan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 2304 - 2309
  • [26] Exploiting prospect theory and risk-awareness to protect UAV-assisted network operation
    Panagiotis Vamvakas
    Eirini Eleni Tsiropoulou
    Symeon Papavassiliou
    EURASIP Journal on Wireless Communications and Networking, 2019
  • [27] Learning Signal Temporal Logic through Neural Network for Interpretable Classification
    Li, Danyang
    Cai, Mingyu
    Vasile, Cristian-Ioan
    Tron, Roberto
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 1907 - 1914
  • [28] Counter-Example Guided Imitation Learning of Feedback Controllers from Temporal Logic Specifications
    Thao Dang
    Donze, Alexandre
    Haque, Inzemamul
    Kekatos, Nikolaos
    Saha, Indranil
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5339 - 5344
  • [29] Hybrid controllers for path planning: A temporal logic approach
    Fainekos, Georgios E.
    Kress-Gazit, Hadas
    Pappas, George J.
    2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 4885 - 4890
  • [30] Dynamic Hypergames for Synthesis of Deceptive Strategies With Temporal Logic Objectives
    Li, Lening
    Ma, Haoxiang
    Kulkarni, Abhishek N.
    Fu, Jie
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (01) : 334 - 345