Adaptive Critic Control With Knowledge Transfer for Uncertain Nonlinear Dynamical Systems: A Reinforcement Learning Approach

被引：0

作者：

Zhang, Liangju ^{[1
,2
]}

Zhang, Kun ^{[3
]}

Xie, Xiang Peng ^{[4
]}

Chadli, Mohammed ^{[5
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Jiangsu, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll ArtificialIntelligence, Nanjing 210023, Jiangsu, Peoples R China

[3] Beihang Univ, Sch Astronaut, Beijing, Peoples R China

[4] Nanjing Univ Posts & Telecommun, Sch Internet Things, Nanjing, Peoples R China

[5] Univ Paris Saclay, IBISC Lab, F-91000 Evry, France

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2025年 / 22卷

基金：

中国国家自然科学基金;

关键词：

Adaptive dynamic programming (ADP); robust optimal control; transfer reinforcement learning; neural networks; DISTURBANCE OBSERVER;

D O I：

10.1109/TASE.2024.3453926

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an online transfer heuristic dynamic programming (THDP) control approach for a class of nonlinear discrete systems. The proposed approach integrates transfer learning with adaptive critic control. To design a robust optimal control strategy for the nonlinear discrete systems, we utilize sample data collected from a source task to acquire prior knowledge. This prior knowledge is subsequently used to guide the online control process of nonlinear systems of target tasks. To avoid negative transfer effects and conserve computational resources, we introduce a novel attenuation function with a truncation mechanism. Additionally, we develop a disturbance compensation control mechanism to address uncertainties. Furthermore, we demonstrate that the properties of the uncertain nonlinear systems under robust optimal control, as well as the weight error of neural networks, are ultimately uniformly bounded given certain conditions. Finally, two simulations are conducted to verify the performance of the proposed algorithm Note to Practitioners-Adaptive dynamic programming (ADP) is one of the main methods to solve the Hamilton-Jacobi-Bellman (HJB) equation. However, when using neural network approximation, it often requires a long time of iteration and a large amount of computational process, wasting a lot of computational resources. For this reason, we propose an ADP control scheme with enhanced detection speed: that is, by learning a class of similar tasks to obtain prior knowledge to assist in the online control of our actual system. At the same time, this paper considers system disturbances, which means that they are more universal and robust. After simulation experiments, it has been proven that this scheme has good performance.

引用

页码：6752 / 6761

页数：10

共 50 条

[31] Optimized Adaptive Nonlinear Tracking Control Using Actor-Critic Reinforcement Learning Strategy
Wen, Guoxing
Chen, C. L. Philip
Ge, Shuzhi Sam
Yang, Hongli
Liu, Xiaoguang
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (09) : 4969 - 4977
[32] Reinforcement learning based adaptive control for uncertain mechanical systems with asymptotic tracking
Liang, Xiang-long
Yao, Zhi-kai
Ge, Yao-wen
Yao, Jian-yong
DEFENCE TECHNOLOGY, 2024, 34 : 19 - 28
[33] A Comparative Analysis of Reinforcement Learning and Adaptive Control Techniques for Linear Uncertain Systems
Wafi, Moh Kamalul
Siami, Milad
2023 PROCEEDINGS OF THE CONFERENCE ON CONTROL AND ITS APPLICATIONS, CT, 2023, : 25 - 32
[34] BACKSTEPPING-BASED ADAPTIVE CRITIC DESIGN FOR UNCERTAIN NONLINEAR SYSTEMS
Yao, J.
Zhu, X.
Li, J.
Hu, F.
CONTROL AND INTELLIGENT SYSTEMS, 2010, 38 (03) : 140 - 147
[35] Reinforcement learning based adaptive control for uncertain mechanical systems with asymptotic tracking
Xiang-long Liang
Zhi-kai Yao
Yao-wen Ge
Jian-yong Yao
Defence Technology, 2024, 34 (04) : 19 - 28
[36] An adaptive iterative learning control framework for a class of uncertain nonlinear systems
Tayebi, Abdelhamid
Chien, Chiang-Ju
PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5056 - +
[37] A unified adaptive iterative learning control framework for uncertain nonlinear systems
Tayebi, Abdelhamid
Chien, Chiang-Ju
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (10) : 1907 - 1913
[38] Reinforcement-Learning-Based Disturbance Rejection Control for Uncertain Nonlinear Systems
Ran, Maopeng
Li, Juncheng
Xie, Lihua
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9621 - 9633
[39] Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation
Bai, Weiwei
Zhou, Qi
Li, Tieshan
Li, Hongyi
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) : 3433 - 3443
[40] An improved adaptive dynamic surface control approach for uncertain nonlinear systems
Zhang, Zhikai
Duan, Guangren
Hou, Mingzhe
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2018, 32 (05) : 713 - 728

← 1 2 3 4 5 →