Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

被引:0
|
作者
Gu, Fangda [1 ]
Yin, He [1 ]
El Ghaoui, Laurent [1 ]
Arcak, Murat [1 ]
Seiler, Peter [2 ]
Jin, Ming [3 ]
机构
[1] Univ Calif Berkeley, 2594 Hearst Ave, Berkeley, CA 94720 USA
[2] Univ Michigan, 500 S State St, Ann Arbor, MI 48109 USA
[3] Virginia Tech, 1185 Perry St 453 Whittemore 0111, Blacksburg, VA 24061 USA
关键词
REINFORCEMENT; OPTIMIZATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network controllers have become popular in control tasks thanks to their flexibility and expressivity. Stability is a crucial property for safety-critical dynamical systems, while stabilization of partially observed systems, in many cases, requires controllers to retain and process long-term memories of the past. We consider the important class of recurrent neural networks (RNN) as dynamic controllers for nonlinear uncertain partially-observed systems, and derive convex stability conditions based on integral quadratic constraints, S-lemma and sequential convexification. To ensure stability during the learning and control process, we propose a projected policy gradient method that iteratively enforces the stability conditions in the reparametrized space taking advantage of mild additional information on system dynamics. Numerical experiments show that our method learns stabilizing controllers while using fewer samples and achieving higher final performance compared with policy gradient.
引用
收藏
页码:5385 / 5394
页数:10
相关论文
共 50 条
  • [41] Neural network based controllers for non-linear systems
    Yan, D
    Saif, M
    CONTROL AND COMPUTERS, 1995, 23 (03): : 73 - 78
  • [42] Formal Verification of Stochastic Systems with ReLU Neural Network Controllers
    Sun, Shiqi
    Zhang, Yan
    Luo, Xusheng
    Vlantis, Panagiotis
    Pajic, Miroslav
    Zavlanos, Michael M.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6800 - 6806
  • [43] Neural network-based quality controllers for manufacturing systems
    Chinnam, RB
    Kolarik, WJ
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1997, 35 (09) : 2601 - 2620
  • [44] Learning Neural Network Controllers for Stabilizing Hybrid Dynamic Systems
    Andreichenko, D. K.
    Zhadaev, F. M.
    IZVESTIYA SARATOVSKOGO UNIVERSITETA NOVAYA SERIYA-MATEMATIKA MEKHANIKA INFORMATIKA, 2018, 18 (03): : 354 - 360
  • [45] Comparative study of neural network controllers for nonlinear dynamic systems
    Hussin, MF
    Abouelnasr, BM
    Shoukry, AA
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, 1822 : 357 - 368
  • [46] ONLINE LEARNING NEURAL-NETWORK CONTROLLERS FOR AUTOPILOT SYSTEMS
    NAPOLITANO, MR
    KINCHELOE, M
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1995, 18 (05) : 1008 - 1015
  • [47] Neural network based optimizing controllers for smart structural systems
    Damle, Rajendra
    Rao, Vittal
    Smart Materials and Structures, 1998, 7 (01): : 23 - 30
  • [48] Safety Verification of Nonlinear Systems with Bayesian Neural Network Controllers
    Zeng, Xia
    Yang, Zhengfeng
    Zhang, Li
    Tang, Xiaochao
    Zeng, Zhenbing
    Liu, Zhiming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15278 - 15286
  • [49] Interval Reachability of Nonlinear Dynamical Systems with Neural Network Controllers
    Jafarpour, Saber
    Harapanahalli, Akash
    Coogan, Samuel
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [50] Neural network based optimizing controllers for smart structural systems
    Damle, R
    Rao, V
    SMART MATERIALS & STRUCTURES, 1998, 7 (01): : 23 - 30