A novel sim2real reinforcement learning algorithm for process control

被引:1
|
作者
Liang, Huiping [1 ,2 ]
Xie, Junyao [2 ]
Huang, Biao [2 ]
Li, Yonggang [1 ,3 ]
Sun, Bei [1 ,3 ]
Yang, Chunhua [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 2V4, Canada
[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Process control; Model-plant mismatch; Fix-horizon return; Industrial roasting process;
D O I
10.1016/j.ress.2024.110639
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
While reinforcement learning (RL) has potential in advanced process control and optimization, its direct interaction with real industrial processes can pose safety concerns. Model-based pre-training of RL may alleviate such risks. However, the intricate nature of industrial processes complicates the establishment of entirely accurate simulation models. Consequently, RL-based controllers relying on simulation models can easily suffer from model-plant mismatch. On the one hand, utilizing offline data for pre-training of RL can also mitigate safety risks. However, it requires well-represented historical datasets. This is demanding because industrial processes mostly run under a regulatory mode with basic controllers. To handle these issues, this paper proposes a novel sim2real reinforcement learning algorithm. First, a state adaptor (SA) is proposed to align simulated states with real states to mitigate the model-plant mismatch. Then, a fix-horizon return is designed to replace traditional infinite-step return to provide genuine labels for the critic network, enhancing learning efficiency and stability. Finally, applying proximal policy optimization (PPO), the SA-PPO method is introduced to implement the proposed sim2real algorithm. Experimental results show that SA-PPO improves performance in MSE by 1.96% and in R by 21.64% on average for roasting process simulation. This verifies the effectiveness of the proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] ADG-Net: A Sim2Real Multimodal Learning Framework for Adaptive Dexterous Grasping
    Zhang, Hui
    Lyu, Jianzhi
    Zhou, Chuangchuang
    Liang, Hongzhuo
    Tu, Yuyang
    Sun, Fuchun
    Zhang, Jianwei
    IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (02) : 840 - 853
  • [42] Nigel-Mechatronic Design and Robust Sim2Real Control of an Overactuated Autonomous Vehicle
    Samak, Chinmay V.
    Samak, Tanmay V.
    Velni, Javad M.
    Krovi, Venkat N.
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2785 - 2793
  • [43] Material Decomposition in Spectral CT Using Deep Learning: A Sim2Real Transfer Approach
    Abascal, Juan F. P. J.
    Ducros, Nicolas
    Pronina, Valeriya
    Rit, Simon
    Rodesch, Pierre-Antoine
    Broussaud, Thomas
    Bussod, Suzanne
    Douek, Philippe C.
    Hauptmann, Andreas
    Arridge, Simon
    Peyrin, Francoise
    IEEE ACCESS, 2021, 9 : 25632 - 25647
  • [44] Sim-to-Real in Reinforcement Learning for Everyone
    Vacaro, Juliano
    Marques, Guilherme
    Oliveira, Bruna
    Paz, Gabriel
    Paula, Thomas
    Staehler, Wagston
    Murphy, David
    2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 305 - 310
  • [45] Sim2real flower detection towards automated Calendula harvesting
    Vierbergen, Wout
    Willekens, Axel
    Dekeyser, Donald
    Cool, Simon
    Wyffels, Francis
    BIOSYSTEMS ENGINEERING, 2023, 234 : 125 - 139
  • [46] Learning Nonprehensile Dynamic Manipulation: Sim2real Vision-Based Policy With a Surgical Robot
    Gondokaryono, Radian
    Haiderbhai, Mustafa
    Suryadevara, Sai Aneesh
    Kahrs, Lueder A.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6763 - 6770
  • [47] OBJECTFOLDER 2.0: A Multisensory Object Dataset for Sim2Real Transfer
    Gao, Ruohan
    Si, Zilin
    Chang, Yen-Yu
    Clarke, Samuel
    Bohg, Jeannette
    Li Fei-Fei
    Yuan, Wenzhen
    Wu, Jiajun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10588 - 10598
  • [48] Exploring Generative AI for Sim2Real in Driving Data Synthesis
    Zhao, Haonan
    Wang, Yiting
    Bashford-Rogers, Thomas
    Donzella, Valentina
    Debattista, Kurt
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 3071 - 3077
  • [49] Sim2real transfer learning for 3D human pose estimation: motion to the rescue
    Doersch, Carl
    Zisserman, Andrew
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [50] Sim2Real Object-Centric Keypoint Detection and Description
    Zhong, Chengliang
    Yang, Chao
    Sun, Fuchun
    Qi, Jinshan
    Mu, Xiaodong
    Liu, Huaping
    Huang, Wenbing
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5440 - 5449