Novel data-driven two-dimensional Q-learning for optimal tracking control of batch process with unknown dynamics

被引:18
|
作者
Wen, Xin [1 ]
Shi, Huiyuan [1 ,2 ,3 ]
Su, Chengli [1 ,4 ,7 ]
Jiang, Xueying [5 ]
Li, Ping [1 ,4 ]
Yu, Jingxian [6 ]
机构
[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
[4] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan, Peoples R China
[5] Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China
[6] Liaoning Petrochem Univ, Sch Sci, Fushun, Peoples R China
[7] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China
基金
中国国家自然科学基金;
关键词
Batchprocess; Data-driven; 2Doff-policyQ-learning; Optimaltrackingcontrol; Injectionmolding; MODEL PREDICTIVE CONTROL; FAULT-TOLERANT CONTROL; STATE DELAY; DESIGN; FEEDBACK;
D O I
10.1016/j.isatra.2021.06.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In view that the previous control methods usually rely too much on the models of batch process and have difficulty in a practical batch process with unknown dynamics, a novel data-driven twodimensional (2D) off-policy Q-learning approach for optimal tracking control (OTC) is proposed to make the batch process obtain a model-free control law. Firstly, an extended state space equation composing of the state and output error is established for ensuring tracking performance of the designed controller. Secondly, the behavior policy of generating data and the target policy of optimization as well as learning is introduced based on this extended system. Then, the Bellman equation independent of model parameters is given via analyzing the relation between 2D value function and 2D Q-function. The measured data along the batch and time directions of batch process are just taken to carry out the policy iteration, which can figure out the optimal control problem despite lacking systematic dynamic information. The unbiasedness and convergence of the designed 2D off-policy Q-learning algorithm are proved. Finally, a simulation case for injection molding process manifests that control effect and tracking effect gradually become better with the increasing number of batches.(c) 2021 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:10 / 21
页数:12
相关论文
共 50 条
  • [21] Control of batch pulping process using data-driven constrained iterative learning control
    Shibani, B.
    Ambure, Prathmesh
    Purohit, Amit
    Suratia, Preetsinh
    Bhartiya, Sharad
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 170
  • [22] Data Driven Q-Learning for Commercial HVAC Control
    Faddel, Samy
    Tian, Guanyu
    Zhou, Qun
    Aburub, Haneen
    IEEE SOUTHEASTCON 2020, 2020,
  • [23] Data-Driven Optimal Tracking Control for Discrete-Time Nonlinear Systems With Unknown Dynamics Using Deterministic ADP
    Song, Shijie
    Gong, Dawei
    Zhu, Minglei
    Zhao, Yuyang
    Huang, Cong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1184 - 1198
  • [24] Data-Driven Optimal Synchronization for Complex Networks With Unknown Dynamics
    Hu, Wenjie
    Gao, Luli
    Dong, Tao
    IEEE ACCESS, 2020, 8 : 224083 - 224091
  • [25] Data-driven adaptive optimal control of linear uncertain systems with unknown jumping dynamics
    Zhang, Meng
    Gan, Ming-Gang
    Chen, Jie
    Jiang, Zhong-Ping
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (12): : 6087 - 6105
  • [26] Data-driven learning for robot control with unknown Jacobian
    Lyu, Shangke
    Cheah, Chien Chern
    AUTOMATICA, 2020, 120
  • [27] Data-driven-based Predictive Optimal for a class of Iterative Learning Control by Q-learning method
    Li, Jinze
    Tian, Senping
    Peng, Yunjian
    Gu, Panpan
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1214 - 1220
  • [28] Distributed data-driven three-dimensional optimal formation control for underactuated AUVs with actuator saturation and unknown dynamics
    Gong, Huibin
    Er, Meng Joo
    Liu, Yi
    Fan, Gaofeng
    OCEAN ENGINEERING, 2025, 325
  • [29] Output Feedback Optimal Tracking Control Using Reinforcement Q-Learning
    Rizvi, Syed Ali Asad
    Lin, Zongli
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3423 - 3428
  • [30] Data-Driven Optimal Structured Control for Unknown Symmetric Systems
    Massenio, Paolo R.
    Rizzello, Gianluca
    Naso, David
    Lewis, Frank L.
    Davoudi, Ali
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 179 - 184