Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems

被引：42

作者：

Yang, Xiong ^{[1
,2
]}

He, Haibo ^{[2
]}

Liu, Derong ^{[3
]}

Zhu, Yuanheng ^{[4
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

[3] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China

[4] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China

来源：

IET CONTROL THEORY AND APPLICATIONS | 2017年 / 11卷 / 14期

基金：

美国国家科学基金会; 中国国家自然科学基金;

关键词：

dynamic programming; robust control; neurocontrollers; continuous time systems; control system synthesis; nonlinear control systems; optimal control; function approximation; Monte Carlo methods; closed loop systems; asymptotic stability; adaptive dynamic programming; robust neural control design; unknown continuous-time nonlinear systems; CT nonlinear systems; ADP-based robust neural control scheme; robust nonlinear control problem; nonlinear optimal control problem; nominal system; ADP algorithm; actor-critic dual networks; control policy approximation; value function approximation; actor neural network weights; critic NN weights; Monte Carlo integration method; closed-loop system; asymptotically stability; APPROXIMATE OPTIMAL-CONTROL; POLICY ITERATION; ALGORITHM; DESIGN;

D O I：

10.1049/iet-cta.2017.0154

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The design of robust controllers for continuous-time (CT) non-linear systems with completely unknown non-linearities is a challenging task. The inability to accurately identify the non-linearities online or offline motivates the design of robust controllers using adaptive dynamic programming (ADP). In this study, an ADP-based robust neural control scheme is developed for a class of unknown CT non-linear systems. To begin with, the robust non-linear control problem is converted into a non-linear optimal control problem via constructing a value function for the nominal system. Then an ADP algorithm is developed to solve the non-linear optimal control problem. The ADP algorithm employs actor-critic dual networks to approximate the control policy and the value function, respectively. Based on this architecture, only system data is necessary to update simultaneously the actor neural network (NN) weights and the critic NN weights. Meanwhile, the persistence of excitation assumption is no longer required by using the Monte Carlo integration method. The closed-loop system with unknown non-linearities is demonstrated to be asymptotically stable under the obtained optimal control. Finally, two examples are provided to validate the developed method.

引用

页码：2307 / 2316

页数：10

共 50 条

[31] H∞ optimal control of unknown continuous time linear periodic systems by adaptive dynamic programming with applications to magnetic attitude control
Jiang, Huaiyuan
Zhou, Bin
OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (03): : 1341 - 1355
[32] Approximate Optimal Adaptive Control of Partially Unknown Linear Continuous-time Systems with State Delay
Moghadam, Rohollah
Jagannathan, Sarangapani
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 1985 - 1990
[33] Robust adaptive control of uncertain non-linear systems with non-linear parameterisation
Liu, Yusheng
INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2006, 1 (02) : 151 - 156
[34] Adaptive robust control of mechanical systems with non-linear dynamic friction compensation
Xu, L.
Yao, B.
INTERNATIONAL JOURNAL OF CONTROL, 2008, 81 (02) : 167 - 176
[35] Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
Yang, Xiong
Liu, Derong
Huang, Yuzhu
IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (17): : 2037 - 2047
[36] Robust adaptive neural control for a class of uncertain non-linear time-delay systems with unknown dead-zone non-linearity
Wang, J.
Hu, J.
IET CONTROL THEORY AND APPLICATIONS, 2011, 5 (15): : 1782 - 1795
[37] A robust adaptive control design for chaotic continuous-time systems
Ohmori, H
Ito, Y
Sano, A
PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 2968 - 2973
[38] Stability Analysis of Robust Adaptive Control of Continuous-Time Systems
Li Yue and Zhao XiaohuiThe work is Supported by National Science Foundation under grant 69574005(Department of Telecommunication
The Journal of China Universities of Posts and Telecommunications, 1999, (01) : 3 - 5
[39] Online approximate optimal control for affine non-linear systems with unknown internal dynamics using adaptive dynamic programming
Yang, Xiong
Liu, Derong
Wei, Qinglai
IET CONTROL THEORY AND APPLICATIONS, 2014, 8 (16): : 1676 - 1688
[40] Adaptive non-linear compensation control based on neural networks for non-linear systems with time delay
Ren, X. M.
Rad, A. B.
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2009, 40 (12) : 1283 - 1292

← 1 2 3 4 5 →