Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes

被引:5
|
作者
Qiao, Junfei [1 ,2 ]
Zhao, Mingming [1 ,2 ]
Wang, Ding [1 ,2 ]
Li, Menghua [1 ,2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
关键词
Action-dependent heuristic dynamic programming (ADHDP); adaptive critic control; adaptive dynamic programming (ADP); tracking control; wastewater treatment applications; DISSOLVED-OXYGEN CONTROL; OPTIMAL TRACKING;
D O I
10.1109/TII.2023.3344130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The wastewater treatment process (WWTP) is beneficial for maintaining sufficient water resources and recycling wastewater. A crucial link of WWTP is to ensure that the dissolved oxygen (DO) concentration is continuously maintained at the predetermined value, which can actually be considered as a tracking problem. In this article, an experience replay-based action-dependent heuristic dynamic programming (ER-ADHDP) method is developed to design the model-free tracking controller to accomplish the tracking goal of the DO concentration. First, the online ER-ADHDP controller is regarded as a supplementary controller to conduct the model-free tracking control alongside a stabilizing controller with a priori knowledge. The online ER-ADHDP method can adaptively adjust weight parameters of critic and action networks, thereby continuously ameliorating the tracking result over time. Second, the ER technique is integrated into the critic and action networks to promote the data utilization efficiency and accelerate the learning process. Third, a rational stability result is provided to theoretically ensure the usefulness of the ER-ADHDP tracking design. Finally, simulation experiments including different reference trajectories are conducted to show the superb tracking performance and excellent adaptability of the proposed ER-ADHDP method.
引用
收藏
页码:6257 / 6265
页数:9
相关论文
共 50 条
  • [1] Action-dependent Heuristic Dynamic Programming for Level Control of Three Tanks
    Song Shaojian
    Li Jinzhi
    Lin Xiaofeng
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 3, 2008, : 468 - 473
  • [2] Emergency voltage control based on action-dependent heuristic dynamic programming
    Feng, Xiao-Feng
    Liu, Ming-Bo
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2014, 42 (04): : 19 - 25
  • [3] A Scalable Low-Power Reconfigurable Accelerator for Action-Dependent Heuristic Dynamic Programming
    Zheng, Nan
    Mazumder, Pinaki
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (06) : 1897 - 1908
  • [4] Small Leak Location for Intelligent Pipeline System via Action-Dependent Heuristic Dynamic Programming
    Hu, Xuguang
    Zhang, Huaguang
    Ma, Dazhong
    Wang, Rui
    Tu, Pengfei
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (11) : 11723 - 11732
  • [5] Online optimizing hot forming parameters for alloy parts based on action-dependent heuristic dynamic programming
    Dong-Dong Chen
    Y. C. Lin
    The International Journal of Advanced Manufacturing Technology, 2019, 104 : 3745 - 3757
  • [6] Stability Analysis of Batch Offline Action-Dependent Heuristic Dynamic Programming Using Deep Neural Networks
    Lala, Timotei
    MATHEMATICS, 2025, 13 (02)
  • [7] Online optimizing hot forming parameters for alloy parts based on action-dependent heuristic dynamic programming
    Chen, Dong-Dong
    Lin, Y. C.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2019, 104 (9-12): : 3745 - 3757
  • [8] Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming
    Liu, Xuanyu
    Xu, Sheng
    Huang, Yueyang
    ISA TRANSACTIONS, 2019, 94 : 28 - 35
  • [9] Action-dependent heuristic dynamic programming for hybrid-order systems based on dynamic event-triggered mechanism
    Li, Jun
    Li, Huaqing
    Zheng, Lifeng
    Ran, Liang
    Ji, Lianghao
    NEUROCOMPUTING, 2024, 600
  • [10] Online discrete-time LQR controller design with integral action for bulk Bucket Wheel Reclaimer operational processes via Action-Dependent Heuristic Dynamic Programming
    de Moura, Jose Pinheiro
    Moraes Rego, Patricia Helena
    da Fonseca Neto, Joao Viana
    ISA TRANSACTIONS, 2019, 90 : 294 - 310