Offline reinforcement learning for industrial process control: A case from steel

被引：14

作者：

Deng, Jifei ^{[1
]}

Sierla, Seppo ^{[1
]}

Sun, Jie ^{[2
]}

Vyatkin, Valeriy ^{[1
,3
]}

机构：

[1] Aalto Univ, Sch Elect Engn, Dept Elect Engn & Automat, Espoo, Finland

[2] Northeastern Univ, State Key Lab Rolling & Automat, Shenyang, Peoples R China

[3] Lulea Univ Technol, Dept Comp Sci Elect & Space Engn, Lulea, Sweden

来源：

INFORMATION SCIENCES | 2023年 / 632卷

基金：

中国国家自然科学基金;

关键词：

Offline reinforcement learning; Deep ensemble; Industrial process control; Steel industry; Strip rolling;

D O I：

10.1016/j.ins.2023.03.019

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Flatness is a crucial indicator of strip quality that presents a challenge in regulation due to the high-speed process and the nonlinear relationship between flatness and process parameters. Conventional methods for controlling flatness are based on the first principles, empirical models, and predesigned rules, which are less adaptable to changing rolling conditions. To address this limitation, this paper proposed an offline reinforcement learning (RL) based data-driven method for flatness control. Based on the data collected from a factory, the offline RL method can learn the process dynamics from data to generate a control policy. Unlike online RL methods, the proposed method does not require a simulator for training, the policy can be potentially safer and more accurate since a simulator involves simplifications that can introduce bias. To obtain a steady performance, the proposed method incorporated ensemble Q-functions into policy eval-uation to address uncertainty estimation. To address distributional shifts, based on Q-values from ensemble Q-functions, behavior cloning was added to policy improvement. Simulation and comparison results showed that the proposed method outperformed the state-of-the-art offline RL methods and achieved the best performance in producing strips with lower flatness.

引用

页码：221 / 231

页数：11

共 50 条

[1] Reinforcement learning for industrial process control: A case study in flatness control in steel industry
Deng, Jifei
Sierla, Seppo
Sun, Jie
Vyatkin, Valeriy
COMPUTERS IN INDUSTRY, 2022, 143
[2] Reinforcement Learning for Online Industrial Process Control
Govindhasamy, James J.
McLoone, Sean F.
Irwin, George W.
French, John J.
Doyle, Richard P.
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2005, 9 (01) : 23 - 30
[3] Offline Meta-Reinforcement Learning for Industrial Insertion
Zhao, Tony Z.
Luo, Jianlan
Sushkov, Oleg
Pevceviciute, Rugile
Heess, Nicolas
Scholz, Jon
Schaal, Stefan
Levine, Sergey
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6386 - 6393
[4] A review On reinforcement learning: Introduction and applications in industrial process control
Nian, Rui
Liu, Jinfeng
Huang, Biao
COMPUTERS & CHEMICAL ENGINEERING, 2020, 139 (139)
[5] Safe reinforcement learning for industrial optimal control: A case study from metallurgical industry
Zheng, Jun
Jia, Runda
Liu, Shaoning
He, Dakuo
Li, Kang
Wang, Fuli
INFORMATION SCIENCES, 2023, 649
[6] Autonomous Building Control Using Offline Reinforcement Learning
Schepers, Jorren
Eyckerman, Reinout
Elmaz, Furkan
Casteels, Wim
Latre, Steven
Hellinckx, Peter
ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 246 - 255
[7] Offline Reinforcement Learning for Adaptive Control in Manufacturing Processes: A Press Hardening Case Study
Nievas, Nuria
Espinosa-Leal, Leonardo
Pages-Bernaus, Adela
Abio, Albert
Echeverria, Lluis
Bonada, Francesc
JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (01)
[8] Offline Reinforcement Learning with Pseudometric Learning
Dadashi, Robert
Rezaeifar, Shideh
Vieillard, Nino
Hussenot, Leonard
Pietquin, Olivier
Geist, Matthieu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[9] Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect
Sacchetto, Luca
Korte, Mathias
Gronauer, Sven
Diepold, Klaus
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7539 - 7544
[10] Offline Model-Based Reinforcement Learning for Tokamak Control
Char, Ian
Abbate, Joseph
Bardoczi, Laszlo
Boyer, Mark D.
Chung, Youngseog
Conlin, Rory
Erickson, Keith
Mehta, Viraj
Richner, Nathan
Kolemen, Egemen
Schneider, Jeff
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211

← 1 2 3 4 5 →