Evolution-guided value iteration for optimal tracking control

被引:0
|
作者
Huang, Haiming
Wang, Ding [1 ]
Zhao, Mingming
Hu, Qinna
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Adaptive critic designs; Adaptive dynamic programming; Evolutionary computation; Intelligent control; Optimal tracking; Reinforcement learning; PARTICLE SWARM; REINFORCEMENT; CONVERGENCE; STABILITY; SYSTEMS;
D O I
10.1016/j.neucom.2024.127835
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, an evolution-guided value iteration (EGVI) algorithm is established to address optimal tracking problems for nonlinear nonaffine systems. Conventional adaptive dynamic programming algorithms rely on gradient information to improve the policy, which adheres to the first order necessity condition. Nonetheless, these methods encounter limitations when gradient information is intricate or system dynamics lack differentiability. In response to this challenge, evolutionary computation is leveraged by EGVI to search for the optimal policy without requiring an action network. The competition within the policy population serves as the driving force for policy improvement. Therefore, EGVI can effectively handle complex and non-differentiable systems. Additionally, this innovative method has the potential to enhance exploration efficiency and bolster the robustness of algorithms due to its population-based characteristics. Furthermore, the convergence of the algorithm and the stability of the policy are investigated based on the EGVI framework. Finally, the effectiveness of the established method is comprehensively demonstrated through two simulation experiments.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Robust value iteration for optimal control of discrete-time linear systems
    Lai, Jing
    Xiong, Junlin
    AUTOMATICA, 2025, 174
  • [42] Evolution-guided evaluation of the inverted terminal repeats of the synthetic transposon Sleeping Beauty
    Scheuermann, Barbara
    Diem, Tanja
    Ivics, Zoltan
    Andrade-Navarro, Miguel A.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [43] Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm
    Hao, Longyan
    Wang, Chaoli
    Zhang, Guang
    Jing, Chonglin
    Shi, Yibo
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2022, 53 (14) : 2845 - 2859
  • [44] General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    Qu, Qiuxia
    NEUROCOMPUTING, 2017, 245 : 114 - 123
  • [45] Evolution-guided engineering of non-heme iron enzymes involved in nogalamycin biosynthesis
    Wandi, Benjamin Nji
    Siitonen, Vilja
    Dinis, Pedro
    Vukic, Vladimir
    Salminen, Tiina A.
    Metsa-Ketela, Mikko
    FEBS JOURNAL, 2020, 287 (14) : 2998 - 3011
  • [46] Enhanced catalytic efficiency and substrate specificity of Streptomyces griseus trypsin by evolution-guided mutagenesis
    Shi, Jingcheng
    Duan, Chaofan
    Pang, Bo
    Wang, Yang
    Du, Guocheng
    Kang, Zhen
    SYSTEMS MICROBIOLOGY AND BIOMANUFACTURING, 2023, 3 (02): : 287 - 297
  • [47] Convergent Evolution-Guided Design of Antimicrobial Peptides Derived from Influenza A Virus Hemagglutinin
    Zhu, Shunyi
    Aumelas, Andre
    Gao, Bin
    JOURNAL OF MEDICINAL CHEMISTRY, 2011, 54 (04) : 1091 - 1095
  • [48] Value-iteration-based Adaptive Optimal Reagents Control for Antimony Flotation Process
    Li, Zhongmei
    Huang, Mengzhe
    Gui, Weihua
    Jiang, zhong-ping
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2244 - 2251
  • [49] Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration
    Granzotto, Mathieu
    Postoyan, Romain
    Busoniu, Lucian
    Nesic, Dragan
    Daafouz, Jamal
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6006 - 6011
  • [50] Improved value iteration for neural-network-based stochastic optimal control design
    Liang, Mingming
    Wang, Ding
    Liu, Derong
    NEURAL NETWORKS, 2020, 124 : 280 - 295