Process control of mAb production using multi-actor proximal policy optimization

被引:8
|
作者
Gupta, Nikita [1 ]
Anand, Shikhar [1 ]
Joshi, Tanuja [1 ]
Kumar, Deepak [1 ]
Ramteke, Manojkumar [1 ,2 ]
Kodamana, Hariprasad [1 ,2 ]
机构
[1] IIT Delhi, Dept Chem Engn, Delhi, India
[2] IIT Delhi, Yardi Sch Artificial Intelligence, Delhi, India
来源
关键词
Monoclonal antibodies (mAb); Reinforcement learning (RL); Deep deterministic policy gradient (DDPG); Proximal policy optimization (PPO); Twin delayed deep deterministic policy; gradient (TD3); MULTIOBJECTIVE OPTIMIZATION; BATCH PROCESSES; CHO-CELLS; TEMPERATURE; PERFORMANCE; METABOLISM; CHALLENGES; EFFICIENT; PROGRESS; IMPACT;
D O I
10.1016/j.dche.2023.100108
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Monoclonal antibodies (mAb) are biopharmaceutical products that improve human immunity. In this work, we propose a multi-actor proximal policy optimization-based reinforcement learning (RL) for the control of mAb production. Here, manipulated variable is flowrate and the control variable is mAb concentration. Based on root mean square error (RMSE) values and convergence performance, it has been observed that multi-actor PPO has performed better as compared to other RL algorithms. It is observed that PPO predicts a 40 % reduction in the number of days to reach the desired concentration. Moreover, the performance of PPO is improved as the number of actors increases. PPO agent shows the best performance with three actors, but on further increasing, its performance deteriorated. These results are verified based on three case studies, namely, (i) for nominal conditions, (ii) in the presence of noise in raw materials and measurements, and (iii) in the presence of stochastic disturbance in temperature and noise in measurements. The results indicate that the proposed approach outperforms the deep deterministic policy gradient (DDPG), twin delayed deep deterministic policy gradient (TD3), and proximal policy optimization (PPO) algorithms for the control of the bioreactor system.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Cooperative multi-actor multi-criteria optimization framework for process integration
    Lechtenberg, Fabian
    Areste-Salo, Lluc
    Espuna, Antonio
    Graells, Moises
    APPLIED ENERGY, 2025, 377
  • [2] Analysis of multi-actor policy contexts using perception graphs
    Bots, Pieter W. G.
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007), 2007, : 160 - 167
  • [3] Optimization of Lignocellulosic Waste Biorefinery using Multi-Actor Multi-Objective Mathematical Framework
    Ashraf, Muhammad T.
    Torres, Ana I.
    Cybulska, Iwona
    Fang, Chuanji
    Thomsen, Mette H.
    Schmidt, Jens E.
    Stephanopoulos, George
    26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT B, 2016, 38B : 1317 - 1322
  • [4] Elaborating a Knowledge Management Plan: A Multi-actor Decision Process
    Tounkara, Thierno
    BEST PRACTICES FOR THE KNOWLEDGE SOCIETY: KNOWLEDGE, LEARNING, DEVELOPMENT AND TECHNOLOGY FOR ALL, 2009, 49 : 305 - 318
  • [5] Assessing Spatial Data Infrastructure Policy Strategies Using the Multi-Actor Multi-Criteria Analysis
    Geudens, Tessa
    Macharis, Cathy
    Crompvoets, Joep
    Plastria, Frank
    INTERNATIONAL JOURNAL OF SPATIAL DATA INFRASTRUCTURES RESEARCH, 2009, 4 : 265 - 297
  • [6] Self-guided deep deterministic policy gradient with multi-actor
    Chen, Hongming
    Liu, Quan
    Zhong, Shan
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (15): : 9723 - 9732
  • [7] Cooperative dual-actor proximal policy optimization algorithm for multi-robot complex control task
    Baltes, Jacky
    Akbar, Ilham
    Saeedvand, Saeed
    ADVANCED ENGINEERING INFORMATICS, 2025, 63
  • [8] Self-guided deep deterministic policy gradient with multi-actor
    Hongming Chen
    Quan Liu
    Shan Zhong
    Neural Computing and Applications, 2021, 33 : 9723 - 9732
  • [9] Multi-actor Policy Dynamics in Research Evaluation: Experts, Databases, and Academics
    Dagiene, Eleonora
    Waltman, Ludo
    Dix, Guus
    HIGHER EDUCATION POLICY, 2025,
  • [10] A multi-actor multi-criteria scenario analysis of regional sustainable resource policy
    Akgun, Aliye Ahu
    van Leeuwen, Eveline
    Nijkamp, Peter
    ECOLOGICAL ECONOMICS, 2012, 78 : 19 - 28