Adaptive Critic Control With Knowledge Transfer for Uncertain Nonlinear Dynamical Systems: A Reinforcement Learning Approach

被引:0
|
作者
Zhang, Liangju [1 ,2 ]
Zhang, Kun [3 ]
Xie, Xiang Peng [4 ]
Chadli, Mohammed [5 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll ArtificialIntelligence, Nanjing 210023, Jiangsu, Peoples R China
[3] Beihang Univ, Sch Astronaut, Beijing, Peoples R China
[4] Nanjing Univ Posts & Telecommun, Sch Internet Things, Nanjing, Peoples R China
[5] Univ Paris Saclay, IBISC Lab, F-91000 Evry, France
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); robust optimal control; transfer reinforcement learning; neural networks; DISTURBANCE OBSERVER;
D O I
10.1109/TASE.2024.3453926
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an online transfer heuristic dynamic programming (THDP) control approach for a class of nonlinear discrete systems. The proposed approach integrates transfer learning with adaptive critic control. To design a robust optimal control strategy for the nonlinear discrete systems, we utilize sample data collected from a source task to acquire prior knowledge. This prior knowledge is subsequently used to guide the online control process of nonlinear systems of target tasks. To avoid negative transfer effects and conserve computational resources, we introduce a novel attenuation function with a truncation mechanism. Additionally, we develop a disturbance compensation control mechanism to address uncertainties. Furthermore, we demonstrate that the properties of the uncertain nonlinear systems under robust optimal control, as well as the weight error of neural networks, are ultimately uniformly bounded given certain conditions. Finally, two simulations are conducted to verify the performance of the proposed algorithm Note to Practitioners-Adaptive dynamic programming (ADP) is one of the main methods to solve the Hamilton-Jacobi-Bellman (HJB) equation. However, when using neural network approximation, it often requires a long time of iteration and a large amount of computational process, wasting a lot of computational resources. For this reason, we propose an ADP control scheme with enhanced detection speed: that is, by learning a class of similar tasks to obtain prior knowledge to assist in the online control of our actual system. At the same time, this paper considers system disturbances, which means that they are more universal and robust. After simulation experiments, it has been proven that this scheme has good performance.
引用
收藏
页码:6752 / 6761
页数:10
相关论文
共 50 条
  • [21] REINFORCEMENT LEARNING FOR THE ADAPTIVE-CONTROL OF NONLINEAR-SYSTEMS
    ZOMAYA, AY
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (02): : 357 - 363
  • [22] Disturbance observer based actor-critic learning control for uncertain nonlinear systems
    Liang, Xianglong
    Yao, Zhikai
    Ge, Yaowen
    Yao, Jianyong
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (11) : 271 - 280
  • [23] Disturbance observer based actor-critic learning control for uncertain nonlinear systems
    Xianglong LIANG
    Zhikai YAO
    Yaowen GE
    Jianyong YAO
    Chinese Journal of Aeronautics, 2023, 36 (11) : 271 - 280
  • [24] Disturbance observer based actor-critic learning control for uncertain nonlinear systems
    Xianglong LIANG
    Zhikai YAO
    Yaowen GE
    Jianyong YAO
    Chinese Journal of Aeronautics , 2023, (11) : 271 - 280
  • [25] Dynamical robust adaptive tracking for uncertain nonlinear systems
    Ahmed-Ali, T
    Lamnabhi-Lagarrigue, F
    INTERNATIONAL JOURNAL OF CONTROL, 1998, 70 (06) : 921 - 939
  • [26] Nonlinear adaptive observer design for uncertain dynamical systems
    Vargas, JAR
    Hemerly, EM
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1307 - 1308
  • [27] Adaptive neural control with fast approximation for uncertain nonlinear systems: A novel composite learning approach
    Xie, Jianbin
    Wang, Shaocong
    Dai, Hao
    Jia, Jinping
    Zhang, Hua
    ASIAN JOURNAL OF CONTROL, 2023, 25 (06) : 4481 - 4498
  • [28] Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems
    Abouheaf, Mohamed
    Boase, Derek
    Gueaieb, Wail
    Spinello, Davide
    Al-Sharhan, Salah
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [29] Adaptive Fixed-Time Optimal Formation Control for Uncertain Nonlinear Multiagent Systems Using Reinforcement Learning
    Wang, Ping
    Yu, Chengpu
    Lv, Maolong
    Cao, Jinde
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1729 - 1743
  • [30] Adaptive critic design with graph Laplacian for online learning control of nonlinear systems
    Lian, Chuanqiang
    Xu, Xin
    Zuo, Lei
    Huang, Zhenhua
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 28 (3-5) : 290 - 304