H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method

被引:40
|
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Cui, Xiaohong [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Adaptive dynamic programming; Data-driven; Neural networks; OPTIMAL TRACKING CONTROL; DYNAMIC-PROGRAMMING ALGORITHM; DIFFERENTIAL GRAPHICAL GAMES; POLICY UPDATE ALGORITHM; ZERO-SUM GAME; FEEDBACK-CONTROL; CONTROL DESIGN; TIME-SYSTEMS; ITERATION; SYNCHRONIZATION;
D O I
10.1016/j.neucom.2016.11.041
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the H-infinity control problem for nonlinear systems with completely unknown dynamics and constrained control input by utilizing a novel data-driven reinforcement learning method. It is known that nonlinear H-infinity control problem relies on the solution of Hamilton-Jacobi-Isaacs (HJI) equation, which is essentially a nonlinear partial differential equation and generally impossible to be solved analytically. In order to overcome this difficulty, firstly, we propose a model-based simultaneoui policy update algorithm to learn the solution of HJI equation iteratively and provide its convergence proof. Then, based on this model-based method, we develop a data-driven model-free algorithm, which only requires the real system sampling data generated by arbitrary different control inputs and external disturbances instead of accurate system models, and prove that these two algorithms are equivalent. To implement this model-free algorithm, three neural networks (NNs) are employed to approximate the iterative performance index function, control policy and disturbance policy, respectively, and the least-square approach is used to minimize the NN approximation residual errors. Finally, the proposed scheme is tested on the rotational/translational actuator nonlinear system.
引用
收藏
页码:226 / 234
页数:9
相关论文
共 50 条
  • [41] Application of an Off-Policy Reinforcement Learning Algorithm for H∞ Control Design of Nonlinear Structural Systems With Completely Unknown Dynamics
    Amirmojahedi, M.
    Mojoodi, A.
    Shojaee, Saeed
    Hamzehei-Javaran, Saleh
    EARTHQUAKE ENGINEERING & STRUCTURAL DYNAMICS, 2025, 54 (04): : 1210 - 1228
  • [42] Sparse Wide-Area Control of Power Systems using Data-driven Reinforcement Learning
    Dizche, Amirhassan Fallah
    Chakrabortty, Aranya
    Duel-Hallen, Alexandra
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2867 - 2872
  • [43] Data-driven learning for robot control with unknown Jacobian
    Lyu, Shangke
    Cheah, Chien Chern
    AUTOMATICA, 2020, 120
  • [44] Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method
    Zhang, Huaguang
    Cui, Lili
    Zhang, Xin
    Luo, Yanhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 2226 - 2236
  • [45] Data-Driven Control and Learning Systems
    Hou, Zhongsheng
    Gao, Huijun
    Lewis, Frank L.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (05) : 4070 - 4075
  • [46] Data-driven consistent control with data compensation for a class of unknown nonlinear multiagent systems with constraints
    Wu, Lipu
    Li, Zhen
    Liu, Shida
    Li, Zhijun
    Sun, Dehui
    IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (18): : 2402 - 2418
  • [47] Data-driven H∞ control of constrained systems: An application to bilateral teleoperation system
    Kucukdemiral, Ibrahim
    Yazici, Hakan
    Gormus, Bilal
    Bevan, Geraint Paul
    ISA TRANSACTIONS, 2023, 137 : 23 - 34
  • [48] Data-Driven Iterative Learning Control for I/O Constrained LTI Systems
    Zhang Ruikun
    Hou Zhongsheng
    Chi Ronghu
    Li Zhenxuan
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3166 - 3171
  • [49] Input-mapping based data-driven model predictive control for unknown linear systems via online learning
    Yang, Lingyi
    Li, Dewei
    Ma, Aoyun
    Xi, Yugeng
    Pu, Ye
    Tan, Ying
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022,
  • [50] Constrained data-driven optimal iterative learning control
    Chi, Ronghu
    Liu, Xiaohe
    Zhang, Ruikun
    Hou, Zhongsheng
    Huang, Biao
    JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29