Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

被引:38
|
作者
Zhu, Lingwei [1 ]
Cui, Yunduan [1 ]
Takami, Go [2 ]
Kanokogi, Hiroaki [2 ]
Matsubara, Takamitsu [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Sci & Technol, Takayama Cho 8916-5, Ikoma, Nara, Japan
[2] Yokogawa Elect Corp, New Field Dev Ctr, Nakacho 2-9-32, Musashino, Tokyo, Japan
关键词
Chemical process control; Reinforcement learning; Vinyl acetate monomer; MODEL;
D O I
10.1016/j.conengprac.2020.104331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Plant-wide optimisation and control of a multi-scale pharmaceutical process
    Patel, Mayank P.
    Shah, Nilay
    Ashe, Robert
    21ST EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2011, 29 : 713 - 717
  • [22] Plant-wide control structures and strategies
    Ng, C
    Stephanopoulos, G
    DYNAMICS & CONTROL OF PROCESS SYSTEMS 1998, VOLUMES 1 AND 2, 1999, : 1 - 16
  • [23] Profitability through plant-wide control
    Moro, TL
    CONTROL AND INSTRUMENTATION, 1999, 31 (02): : 24 - 24
  • [24] Supervisory Stability Assurance Layer for Hierarchical Plant-wide Process Control
    Tri Tran
    Bao, Jie
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 4409 - 4414
  • [25] Plantwide control system design of the benchmark vinyl acetate monomer production plant
    Seki, Hiroya
    Ogawa, Morimasa
    Itoh, Toshiaki
    Ootakara, Shigeki
    Murata, Hisashi
    Hashimoto, Yoshihiro
    Kano, Manabu
    COMPUTERS & CHEMICAL ENGINEERING, 2010, 34 (08) : 1282 - 1295
  • [26] Plantwide control structure selection methodology for the benchmark vinyl acetate monomer plant
    Psaltis, Andreas
    Kookos, Ioannis K.
    Kravaris, Costas
    COMPUTERS & CHEMICAL ENGINEERING, 2014, 62 : 108 - 116
  • [27] NONLINEAR PLANT-WIDE CONTROL - APPLICATION TO A SUPERCRITICAL FLUID EXTRACTION PROCESS
    RAMCHANDRAN, B
    RIGGS, JB
    HEICHELHEIM, HR
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1992, 31 (01) : 290 - 300
  • [28] Using process topology in plant-wide control loop performance assessment
    Yim, S. Y.
    Ananthakumar, H. G.
    Benabbas, L.
    Horch, A.
    Drathb, R.
    Thornhill, N. F.
    COMPUTERS & CHEMICAL ENGINEERING, 2006, 31 (02) : 86 - 99
  • [29] PLANT-WIDE CONTROL OF THE TENNESSEE EASTMAN PROBLEM
    LYMAN, PR
    GEORGAKIS, C
    COMPUTERS & CHEMICAL ENGINEERING, 1995, 19 (03) : 321 - 331
  • [30] Perspectives on the synthesis of plant-wide control structures
    Stephanopoulos, G
    Ng, C
    JOURNAL OF PROCESS CONTROL, 2000, 10 (2-3) : 97 - 111