Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

被引:0
|
作者
Gu, Fangda [1 ]
Yin, He [1 ]
El Ghaoui, Laurent [1 ]
Arcak, Murat [1 ]
Seiler, Peter [2 ]
Jin, Ming [3 ]
机构
[1] Univ Calif Berkeley, 2594 Hearst Ave, Berkeley, CA 94720 USA
[2] Univ Michigan, 500 S State St, Ann Arbor, MI 48109 USA
[3] Virginia Tech, 1185 Perry St 453 Whittemore 0111, Blacksburg, VA 24061 USA
关键词
REINFORCEMENT; OPTIMIZATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network controllers have become popular in control tasks thanks to their flexibility and expressivity. Stability is a crucial property for safety-critical dynamical systems, while stabilization of partially observed systems, in many cases, requires controllers to retain and process long-term memories of the past. We consider the important class of recurrent neural networks (RNN) as dynamic controllers for nonlinear uncertain partially-observed systems, and derive convex stability conditions based on integral quadratic constraints, S-lemma and sequential convexification. To ensure stability during the learning and control process, we propose a projected policy gradient method that iteratively enforces the stability conditions in the reparametrized space taking advantage of mild additional information on system dynamics. Numerical experiments show that our method learns stabilizing controllers while using fewer samples and achieving higher final performance compared with policy gradient.
引用
收藏
页码:5385 / 5394
页数:10
相关论文
共 50 条
  • [31] STABILITY ANALYSIS OF A PARTIALLY CONNECTED FEEDBACK NEURAL NETWORK
    Wang, Didi
    Zhou, Changle
    Chang, Pei-Chann
    Huang, Wei-Hsiu
    Wu, Jheng-Long
    Chang, Kai-Ting
    THIRD INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY (ICCET 2011), 2011, : 245 - +
  • [32] Efficient Verification of Control Systems with Neural Network Controllers
    Yang, Guoqing
    Qian, Guangyi
    Lv, Pan
    Li, Hong
    ICVISP 2019: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING, 2019,
  • [33] Improvement of aeroelastic vehicles performance through recurrent neural network controllers
    Claudio Brillante
    Andrea Mannarino
    Nonlinear Dynamics, 2016, 84 : 1479 - 1495
  • [34] Improvement of aeroelastic vehicles performance through recurrent neural network controllers
    Brillante, Claudio
    Mannarino, Andrea
    NONLINEAR DYNAMICS, 2016, 84 (03) : 1479 - 1495
  • [35] Memory Augmented Neural Network Adaptive Controllers: Performance and Stability
    Muthirayan, Deepan
    Khargonekar, Pramod P.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (02) : 825 - 838
  • [36] Simple recurrent neural network:: A neural network structure for control systems
    Hernández, RP
    Gallegos, JA
    Reyes, JAH
    NEUROCOMPUTING, 1998, 23 (1-3) : 277 - 289
  • [37] A modified alopex which guarantees stability for a class of closed-loop neural-network control systems
    纪军红
    麻亮
    强文义
    傅佩琛
    Journal of Harbin Institute of Technology, 1999, (04) : 3 - 5
  • [38] Stability Analysis of Recurrent-Neural-Based Controllers Using Dissipativity Domain
    Jafari, Reza
    MATHEMATICS, 2023, 11 (14)
  • [39] Ensuring Both Positivity and Stability Using Sector-Bounded Nonlinearity for Systems With Neural Network Controllers
    Hedesh, Hamidreza Montazeri
    Siami, Milad
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1685 - 1690
  • [40] A recurrent neural network for modelling dynamical systems
    Bailer-Jones, CAL
    MacKay, DJC
    Withers, PJ
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1998, 9 (04) : 531 - 547