Reinforcement learning-based estimation for spatio-temporal systems

被引:1
|
作者
Mowlavi, Saviz [1 ]
Benosman, Mouhacine [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Estimation; Filtering; Partial differential equations; Model reduction; Reinforcement learning; MODEL-REDUCTION; FLUID-FLOWS;
D O I
10.1038/s41598-024-72055-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
State estimators such as Kalman filters compute an estimate of the instantaneous state of a dynamical system from sparse sensor measurements. For spatio-temporal systems, whose dynamics are governed by partial differential equations (PDEs), state estimators are typically designed based on a reduced-order model (ROM) that projects the original high-dimensional PDE onto a computationally tractable low-dimensional space. However, ROMs are prone to large errors, which negatively affects the performance of the estimator. Here, we introduce the reinforcement learning reduced-order estimator (RL-ROE), a ROM-based estimator in which the correction term that takes in the measurements is given by a nonlinear policy trained through reinforcement learning. The nonlinearity of the policy enables the RL-ROE to compensate efficiently for errors of the ROM, while still taking advantage of the imperfect knowledge of the dynamics. Using examples involving the Burgers and Navier-Stokes equations with parametric uncertainties, we show that in the limit of very few sensors, the trained RL-ROE outperforms a Kalman filter designed using the same ROM and yields accurate instantaneous estimates of high-dimensional states corresponding to unknown initial conditions and physical parameter values. The RL-ROE opens the door to lightweight real-time sensing of systems governed by parametric PDEs.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Spatio-Temporal Contrastive Learning-Based Adaptive Graph Augmentation for Traffic Flow Prediction
    Zhang, Dingkai
    Wang, Pengfei
    Ding, Lu
    Wang, Xiaoling
    He, Jifeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 1304 - 1318
  • [22] Automatic Spatio-Temporal Deep Learning-Based Approach for Cardiac Cine MRI Segmentation
    Ammar, Abderazzak
    Bouattane, Omar
    Youssfi, Mohamed
    NETWORKING, INTELLIGENT SYSTEMS AND SECURITY, 2022, 237 : 59 - 73
  • [23] Robust sensor location for parameter estimation in iterative learning control of spatio-temporal systems
    Kowalow, Damian
    Patan, Maciej
    2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS), 2017,
  • [24] Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling
    Sosic, Adrian
    Zoubir, Abdelhak M.
    Rueckert, Elmar
    Peters, Jan
    Koeppl, Heinz
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 19
  • [25] Spatio-temporal Trajectory Learning using Simulation Systems
    Glake, Daniel
    Panse, Fabian
    Lenfers, Ulfia
    Clemen, Thomas
    Ritter, Norbert
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 592 - 602
  • [26] Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows
    Lin, Xiaoshan
    Koochakzadeh, Abbasali
    Yazicio, Yasin
    Aksaray, Derya
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 8680 - 8686
  • [27] CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
    Ho, Chi-Kai
    King, Chung-Ta
    IEEE ACCESS, 2023, 11 : 26820 - 26831
  • [28] Adaptive motion estimation based on spatio-temporal correlation
    Kim, DW
    Choi, JS
    Kim, JT
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1998, 13 (02) : 161 - 170
  • [29] Adaptive motion estimation based on spatio-temporal correlation
    Hong, Bo
    Zhuang, Jianmin
    Yu, Songyu
    2000, Shanghai Comp Soc, China (26):
  • [30] Block motion estimation based on spatio-temporal correlation
    Kim, DW
    Choi, JH
    Choi, YS
    Jeon, CH
    Ko, NY
    1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 955 - 960