Offline reinforcement learning for industrial process control: A case from steel

被引:14
|
作者
Deng, Jifei [1 ]
Sierla, Seppo [1 ]
Sun, Jie [2 ]
Vyatkin, Valeriy [1 ,3 ]
机构
[1] Aalto Univ, Sch Elect Engn, Dept Elect Engn & Automat, Espoo, Finland
[2] Northeastern Univ, State Key Lab Rolling & Automat, Shenyang, Peoples R China
[3] Lulea Univ Technol, Dept Comp Sci Elect & Space Engn, Lulea, Sweden
基金
中国国家自然科学基金;
关键词
Offline reinforcement learning; Deep ensemble; Industrial process control; Steel industry; Strip rolling;
D O I
10.1016/j.ins.2023.03.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flatness is a crucial indicator of strip quality that presents a challenge in regulation due to the high-speed process and the nonlinear relationship between flatness and process parameters. Conventional methods for controlling flatness are based on the first principles, empirical models, and predesigned rules, which are less adaptable to changing rolling conditions. To address this limitation, this paper proposed an offline reinforcement learning (RL) based data-driven method for flatness control. Based on the data collected from a factory, the offline RL method can learn the process dynamics from data to generate a control policy. Unlike online RL methods, the proposed method does not require a simulator for training, the policy can be potentially safer and more accurate since a simulator involves simplifications that can introduce bias. To obtain a steady performance, the proposed method incorporated ensemble Q-functions into policy eval-uation to address uncertainty estimation. To address distributional shifts, based on Q-values from ensemble Q-functions, behavior cloning was added to policy improvement. Simulation and comparison results showed that the proposed method outperformed the state-of-the-art offline RL methods and achieved the best performance in producing strips with lower flatness.
引用
收藏
页码:221 / 231
页数:11
相关论文
共 50 条
  • [1] Reinforcement learning for industrial process control: A case study in flatness control in steel industry
    Deng, Jifei
    Sierla, Seppo
    Sun, Jie
    Vyatkin, Valeriy
    COMPUTERS IN INDUSTRY, 2022, 143
  • [2] Reinforcement Learning for Online Industrial Process Control
    Govindhasamy, James J.
    McLoone, Sean F.
    Irwin, George W.
    French, John J.
    Doyle, Richard P.
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2005, 9 (01) : 23 - 30
  • [3] Offline Meta-Reinforcement Learning for Industrial Insertion
    Zhao, Tony Z.
    Luo, Jianlan
    Sushkov, Oleg
    Pevceviciute, Rugile
    Heess, Nicolas
    Scholz, Jon
    Schaal, Stefan
    Levine, Sergey
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6386 - 6393
  • [4] A review On reinforcement learning: Introduction and applications in industrial process control
    Nian, Rui
    Liu, Jinfeng
    Huang, Biao
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 139 (139)
  • [5] Safe reinforcement learning for industrial optimal control: A case study from metallurgical industry
    Zheng, Jun
    Jia, Runda
    Liu, Shaoning
    He, Dakuo
    Li, Kang
    Wang, Fuli
    INFORMATION SCIENCES, 2023, 649
  • [6] Autonomous Building Control Using Offline Reinforcement Learning
    Schepers, Jorren
    Eyckerman, Reinout
    Elmaz, Furkan
    Casteels, Wim
    Latre, Steven
    Hellinckx, Peter
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 246 - 255
  • [7] Offline Reinforcement Learning for Adaptive Control in Manufacturing Processes: A Press Hardening Case Study
    Nievas, Nuria
    Espinosa-Leal, Leonardo
    Pages-Bernaus, Adela
    Abio, Albert
    Echeverria, Lluis
    Bonada, Francesc
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (01)
  • [8] Offline Reinforcement Learning with Pseudometric Learning
    Dadashi, Robert
    Rezaeifar, Shideh
    Vieillard, Nino
    Hussenot, Leonard
    Pietquin, Olivier
    Geist, Matthieu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect
    Sacchetto, Luca
    Korte, Mathias
    Gronauer, Sven
    Diepold, Klaus
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7539 - 7544
  • [10] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211