A novel sim2real reinforcement learning algorithm for process control

被引：1

作者：

Liang, Huiping ^{[1
,2
]}

Xie, Junyao ^{[2
]}

Huang, Biao ^{[2
]}

Li, Yonggang ^{[1
,3
]}

Sun, Bei ^{[1
,3
]}

Yang, Chunhua ^{[1
]}

机构：

[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China

[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 2V4, Canada

[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China

来源：

RELIABILITY ENGINEERING & SYSTEM SAFETY | 2025年 / 254卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Process control; Model-plant mismatch; Fix-horizon return; Industrial roasting process;

D O I：

10.1016/j.ress.2024.110639

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

While reinforcement learning (RL) has potential in advanced process control and optimization, its direct interaction with real industrial processes can pose safety concerns. Model-based pre-training of RL may alleviate such risks. However, the intricate nature of industrial processes complicates the establishment of entirely accurate simulation models. Consequently, RL-based controllers relying on simulation models can easily suffer from model-plant mismatch. On the one hand, utilizing offline data for pre-training of RL can also mitigate safety risks. However, it requires well-represented historical datasets. This is demanding because industrial processes mostly run under a regulatory mode with basic controllers. To handle these issues, this paper proposes a novel sim2real reinforcement learning algorithm. First, a state adaptor (SA) is proposed to align simulated states with real states to mitigate the model-plant mismatch. Then, a fix-horizon return is designed to replace traditional infinite-step return to provide genuine labels for the critic network, enhancing learning efficiency and stability. Finally, applying proximal policy optimization (PPO), the SA-PPO method is introduced to implement the proposed sim2real algorithm. Experimental results show that SA-PPO improves performance in MSE by 1.96% and in R by 21.64% on average for roasting process simulation. This verifies the effectiveness of the proposed method.

引用

页数：12

共 50 条

[41] ADG-Net: A Sim2Real Multimodal Learning Framework for Adaptive Dexterous Grasping
Zhang, Hui
Lyu, Jianzhi
Zhou, Chuangchuang
Liang, Hongzhuo
Tu, Yuyang
Sun, Fuchun
Zhang, Jianwei
IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (02) : 840 - 853
[42] Nigel-Mechatronic Design and Robust Sim2Real Control of an Overactuated Autonomous Vehicle
Samak, Chinmay V.
Samak, Tanmay V.
Velni, Javad M.
Krovi, Venkat N.
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2785 - 2793
[43] Material Decomposition in Spectral CT Using Deep Learning: A Sim2Real Transfer Approach
Abascal, Juan F. P. J.
Ducros, Nicolas
Pronina, Valeriya
Rit, Simon
Rodesch, Pierre-Antoine
Broussaud, Thomas
Bussod, Suzanne
Douek, Philippe C.
Hauptmann, Andreas
Arridge, Simon
Peyrin, Francoise
IEEE ACCESS, 2021, 9 : 25632 - 25647
[44] Sim-to-Real in Reinforcement Learning for Everyone
Vacaro, Juliano
Marques, Guilherme
Oliveira, Bruna
Paz, Gabriel
Paula, Thomas
Staehler, Wagston
Murphy, David
2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 305 - 310
[45] Sim2real flower detection towards automated Calendula harvesting
Vierbergen, Wout
Willekens, Axel
Dekeyser, Donald
Cool, Simon
Wyffels, Francis
BIOSYSTEMS ENGINEERING, 2023, 234 : 125 - 139
[46] Learning Nonprehensile Dynamic Manipulation: Sim2real Vision-Based Policy With a Surgical Robot
Gondokaryono, Radian
Haiderbhai, Mustafa
Suryadevara, Sai Aneesh
Kahrs, Lueder A.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6763 - 6770
[47] OBJECTFOLDER 2.0: A Multisensory Object Dataset for Sim2Real Transfer
Gao, Ruohan
Si, Zilin
Chang, Yen-Yu
Clarke, Samuel
Bohg, Jeannette
Li Fei-Fei
Yuan, Wenzhen
Wu, Jiajun
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10588 - 10598
[48] Exploring Generative AI for Sim2Real in Driving Data Synthesis
Zhao, Haonan
Wang, Yiting
Bashford-Rogers, Thomas
Donzella, Valentina
Debattista, Kurt
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 3071 - 3077
[49] Sim2real transfer learning for 3D human pose estimation: motion to the rescue
Doersch, Carl
Zisserman, Andrew
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[50] Sim2Real Object-Centric Keypoint Detection and Description
Zhong, Chengliang
Yang, Chao
Sun, Fuchun
Qi, Jinshan
Mu, Xiaodong
Liu, Huaping
Huang, Wenbing
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5440 - 5449

← 1 2 3 4 5 →