Model-based reinforcement learning control of reaction-diffusion problems

被引：0

作者：

Schenk, Christina ^{[1
]}

Vasudevan, Aditya ^{[1
]}

Haranczyk, Maciej ^{[1
]}

Romero, Ignacio ^{[1
,2
]}

机构：

[1] IMDEA Mat Inst, Eric Kandel 2, Madrid 28906, Spain

[2] Univ Politecn Madrid, Dept Mech Engn, Madrid, Spain

来源：

OPTIMAL CONTROL APPLICATIONS & METHODS | 2024年 / 45卷 / 06期

关键词：

disease and thermal transport; optimal control; partial differential equations; policy-gradient methods; reaction-diffusion; reinforcement learning; DYNAMICS;

D O I：

10.1002/oca.3196

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mathematical and computational tools have proven to be reliable in decision-making processes. In recent times, in particular, machine learning-based methods are becoming increasingly popular as advanced support tools. When dealing with control problems, reinforcement learning has been applied to decision-making in several applications, most notably in games. The success of these methods in finding solutions to complex problems motivates the exploration of new areas where they can be employed to overcome current difficulties. In this article, we explore the use of automatic control strategies to initial boundary value problems in thermal and disease transport. Specifically, in this work, we adapt an existing reinforcement learning algorithm using a stochastic policy gradient method and we introduce two novel reward functions to drive the flow of the transported field. The new model-based framework exploits the interactions between a reaction-diffusion model and the modified agent. The results show that certain controls can be implemented successfully in these applications, although model simplifications had to be assumed. This paper explores reinforcement learning for control in thermal and disease transport problems, adapting a stochastic policy gradient algorithm and introducing novel reward functions. The new model-based framework leverages interactions between a reaction-diffusion model and the modified agent. Results demonstrate successful RL-based control for these applications despite necessary model simplifications. image

引用

页码：2897 / 2914

页数：18

共 50 条

[21] A survey on model-based reinforcement learning
Fan-Ming LUO
Tian XU
Hang LAI
Xiong-Hui CHEN
Weinan ZHANG
Yang YU
Science China(Information Sciences), 2024, 67 (02) : 59 - 84
[22] Nonparametric model-based reinforcement learning
Atkeson, CG
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1008 - 1014
[23] The ubiquity of model-based reinforcement learning
Doll, Bradley B.
Simon, Dylan A.
Daw, Nathaniel D.
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 1075 - 1081
[24] Multiple model-based reinforcement learning
Doya, K
Samejima, K
Katagiri, K
Kawato, M
NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
[25] A survey on model-based reinforcement learning
Luo, Fan-Ming
Xu, Tian
Lai, Hang
Chen, Xiong-Hui
Zhang, Weinan
Yu, Yang
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
[26] MIXING IN REACTION-DIFFUSION PROBLEMS
SOKOLOV, IM
BLUMEN, A
INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 1991, 5 (20): : 3127 - 3164
[27] Upscaling in reaction-diffusion problems
Timofte, Claudia
Numerical Analysis and Applied Mathematics, 2007, 936 : 547 - 550
[28] BIFURCATIONS IN REACTION-DIFFUSION PROBLEMS
HOWARD, LN
ADVANCES IN MATHEMATICS, 1975, 16 (02) : 246 - 258
[29] Reaction-Diffusion Model-Based Research on Formation Mechanism of Neuron Dendritic Spine Patterns
Jia, Yiqing
Zhao, Qili
Yin, Hongqiang
Guo, Shan
Sun, Mingzhu
Yang, Zhuo
Zhao, Xin
FRONTIERS IN NEUROROBOTICS, 2021, 15
[30] Hybrid control for combining model-based and model-free reinforcement learning
Pinosky, Allison
Abraham, Ian
Broad, Alexander
Argall, Brenna
Murphey, Todd D.
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 42 (06): : 337 - 355

← 1 2 3 4 5 →