H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method

被引：40

作者：

Jiang, He ^{[1
]}

Zhang, Huaguang ^{[1
]}

Luo, Yanhong ^{[1
]}

Cui, Xiaohong ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 237卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Adaptive dynamic programming; Data-driven; Neural networks; OPTIMAL TRACKING CONTROL; DYNAMIC-PROGRAMMING ALGORITHM; DIFFERENTIAL GRAPHICAL GAMES; POLICY UPDATE ALGORITHM; ZERO-SUM GAME; FEEDBACK-CONTROL; CONTROL DESIGN; TIME-SYSTEMS; ITERATION; SYNCHRONIZATION;

D O I：

10.1016/j.neucom.2016.11.041

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates the H-infinity control problem for nonlinear systems with completely unknown dynamics and constrained control input by utilizing a novel data-driven reinforcement learning method. It is known that nonlinear H-infinity control problem relies on the solution of Hamilton-Jacobi-Isaacs (HJI) equation, which is essentially a nonlinear partial differential equation and generally impossible to be solved analytically. In order to overcome this difficulty, firstly, we propose a model-based simultaneoui policy update algorithm to learn the solution of HJI equation iteratively and provide its convergence proof. Then, based on this model-based method, we develop a data-driven model-free algorithm, which only requires the real system sampling data generated by arbitrary different control inputs and external disturbances instead of accurate system models, and prove that these two algorithms are equivalent. To implement this model-free algorithm, three neural networks (NNs) are employed to approximate the iterative performance index function, control policy and disturbance policy, respectively, and the least-square approach is used to minimize the NN approximation residual errors. Finally, the proposed scheme is tested on the rotational/translational actuator nonlinear system.

引用

页码：226 / 234

页数：9

共 50 条

[41] Application of an Off-Policy Reinforcement Learning Algorithm for H∞ Control Design of Nonlinear Structural Systems With Completely Unknown Dynamics
Amirmojahedi, M.
Mojoodi, A.
Shojaee, Saeed
Hamzehei-Javaran, Saleh
EARTHQUAKE ENGINEERING & STRUCTURAL DYNAMICS, 2025, 54 (04): : 1210 - 1228
[42] Sparse Wide-Area Control of Power Systems using Data-driven Reinforcement Learning
Dizche, Amirhassan Fallah
Chakrabortty, Aranya
Duel-Hallen, Alexandra
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2867 - 2872
[43] Data-driven learning for robot control with unknown Jacobian
Lyu, Shangke
Cheah, Chien Chern
AUTOMATICA, 2020, 120
[44] Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method
Zhang, Huaguang
Cui, Lili
Zhang, Xin
Luo, Yanhong
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 2226 - 2236
[45] Data-Driven Control and Learning Systems
Hou, Zhongsheng
Gao, Huijun
Lewis, Frank L.
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (05) : 4070 - 4075
[46] Data-driven consistent control with data compensation for a class of unknown nonlinear multiagent systems with constraints
Wu, Lipu
Li, Zhen
Liu, Shida
Li, Zhijun
Sun, Dehui
IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (18): : 2402 - 2418
[47] Data-driven H∞ control of constrained systems: An application to bilateral teleoperation system
Kucukdemiral, Ibrahim
Yazici, Hakan
Gormus, Bilal
Bevan, Geraint Paul
ISA TRANSACTIONS, 2023, 137 : 23 - 34
[48] Data-Driven Iterative Learning Control for I/O Constrained LTI Systems
Zhang Ruikun
Hou Zhongsheng
Chi Ronghu
Li Zhenxuan
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3166 - 3171
[49] Input-mapping based data-driven model predictive control for unknown linear systems via online learning
Yang, Lingyi
Li, Dewei
Ma, Aoyun
Xi, Yugeng
Pu, Ye
Tan, Ying
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022,
[50] Constrained data-driven optimal iterative learning control
Chi, Ronghu
Liu, Xiaohe
Zhang, Ruikun
Hou, Zhongsheng
Huang, Biao
JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29

← 1 2 3 4 5 →