Evolution-guided value iteration for optimal tracking control

被引:0
|
作者
Huang, Haiming
Wang, Ding [1 ]
Zhao, Mingming
Hu, Qinna
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Adaptive critic designs; Adaptive dynamic programming; Evolutionary computation; Intelligent control; Optimal tracking; Reinforcement learning; PARTICLE SWARM; REINFORCEMENT; CONVERGENCE; STABILITY; SYSTEMS;
D O I
10.1016/j.neucom.2024.127835
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, an evolution-guided value iteration (EGVI) algorithm is established to address optimal tracking problems for nonlinear nonaffine systems. Conventional adaptive dynamic programming algorithms rely on gradient information to improve the policy, which adheres to the first order necessity condition. Nonetheless, these methods encounter limitations when gradient information is intricate or system dynamics lack differentiability. In response to this challenge, evolutionary computation is leveraged by EGVI to search for the optimal policy without requiring an action network. The competition within the policy population serves as the driving force for policy improvement. Therefore, EGVI can effectively handle complex and non-differentiable systems. Additionally, this innovative method has the potential to enhance exploration efficiency and bolster the robustness of algorithms due to its population-based characteristics. Furthermore, the convergence of the algorithm and the stability of the policy are investigated based on the EGVI framework. Finally, the effectiveness of the established method is comprehensively demonstrated through two simulation experiments.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Evolution-Guided Adaptive Dynamic Programming for Nonlinear Optimal Control
    Wang, Ding
    Huang, Haiming
    Liu, Derong
    Zhao, Mingming
    Qiao, Junfei
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (10): : 6043 - 6054
  • [2] Evolution-guided design of phosphatase inhibitors
    Hjortness, Michael
    Riccardi, Laura
    Hongdusit, Akarawin
    Ruppe, Alex
    Kim, Edward
    Zhao, Mengxia
    Zwart, Peter
    Sankaran, Banumathi
    Arthanari, Haribabu
    Sousa, Marcelo
    Devivo, Marco
    Fox, Jerome
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [3] Evolution-guided optimization of biosynthetic pathways
    Raman, Srivatsan
    Rogers, Jameson K.
    Taylor, Noah D.
    Church, George M.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (50) : 17803 - 17808
  • [4] Evolution-Guided Biosynthesis of Terpenoid Inhibitors
    Sarkar, Ankur
    Foderaro, Tom
    Kramer, Levi
    Markley, Andrew L.
    Lee, Jessica
    Traylor, Matthew J.
    Fox, Jerome M.
    ACS SYNTHETIC BIOLOGY, 2022, 11 (09): : 3015 - 3027
  • [5] Evolution-Guided Policy Gradient in Reinforcement Learning
    Khadka, Shauharda
    Tumer, Kagan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Advanced Affine Optimal Tracking Control Through Online Value Iteration and Its Stability Proof
    Wu, Junlong
    Wang, Ding
    Ha, Mingming
    Zhao, Mingming
    Ren, Jin
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2224 - 2229
  • [7] Optimal State Tracking Control for Linear Discrete-time Systems Via Value Iteration
    Liu, Yingying
    Shi, Zhan
    Wang, Zhanshan
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 836 - 841
  • [8] Analysis of Stabilizing Value Iteration for Adaptive Optimal Control
    Heydari, Ali
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5746 - 5751
  • [9] Evolution-guided discovery of antibiotics that inhibit peptidoglycan remodelling
    Elizabeth J. Culp
    Nicholas Waglechner
    Wenliang Wang
    Aline A. Fiebig-Comyn
    Yen-Pang Hsu
    Kalinka Koteva
    David Sychantha
    Brian K. Coombes
    Michael S. Van Nieuwenhze
    Yves V. Brun
    Gerard D. Wright
    Nature, 2020, 578 : 582 - 587
  • [10] Evolution-guided discovery of antibiotics that inhibit peptidoglycan remodelling
    Culp, Elizabeth J.
    Waglechner, Nicholas
    Wang, Wenliang
    Fiebig-Comyn, Aline A.
    Hsu, Yen-Pang
    Koteva, Kalinka
    Sychantha, David
    Coombes, Brian K.
    Van Nieuwenhze, Michael S.
    Brun, Yves, V
    Wright, Gerard D.
    NATURE, 2020, 578 (7796) : 582 - +