Moment-Based Reinforcement Learning for Ensemble Control

被引：2

作者：

Yu, Yao-Chi ^{[1
]}

Narayanan, Vignesh ^{[2
]}

Li, Jr-Shin ^{[1
]}

机构：

[1] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63130 USA

[2] Univ South Carolina, AI Inst, Columbia, SC 29208 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 09期

基金：

美国国家科学基金会; 美国国家卫生研究院;

关键词：

Data-driven control; ensemble control systems; moment methods; reinforcement learning (RL); CONTROLLABILITY; CONVERGENCE; SYSTEMS;

D O I：

10.1109/TNNLS.2023.3264151

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Problems involving controlling the collective behavior of a population of structurally similar dynamical systems, the so-called ensemble control, arise in diverse emerging applications and pose a grand challenge in systems science and control engineering. Owing to the severely under-actuated nature and the difficulty of placing large-scale sensor networks, ensemble systems are limited to being actuated and monitored at the population level. Moreover, mathematical models describing the dynamics of ensemble systems are often elusive. Therefore, it is essential to design broadcast controls that excite the entire population in such a way that the heterogeneity in system dynamics is robustly compensated. In this article, we propose a reinforcement learning (RL)-based data-driven control framework incorporating population-level aggregated measurement data to learn a global control signal for steering a dynamic population in the desired manner. In particular, we introduce the notion of ensemble moments induced by aggregated measurements and derive the associated moment system to the original ensemble system. Then, using the moment system, we learn an approximation of optimal value functions and the associated policies in terms of ensemble moments through RL. We illustrate the feasibility and scalability of the proposed moment-based approach via numerical experiments using a population of linear, bilinear, and nonlinear dynamic ensemble systems. We report that the proposed method achieves the desired control objectives of various ensemble control tasks and obtains significantly better averaged-reward when compared with three existing methods.

引用

页码：12653 / 12664

页数：12

共 50 条

[21] On generalization in moment-based domain adaptation
Zellinger, Werner
Moser, Bernhard A.
Saminger-Platz, Susanne
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2021, 89 (3-4) : 333 - 369
[22] Third moment-based causal inference
Wiedermann W.
Behaviormetrika, 2022, 49 (2) : 303 - 328
[23] Moment-based estimation of stochastic volatility
Bregantini, Daniele
JOURNAL OF BANKING & FINANCE, 2013, 37 (12) : 4755 - 4764
[24] Moment-based metrics for mesh simplification
Tang, H.
Shu, H. Z.
Dillenseger, J. L.
Bao, X. D.
Luo, L. M.
COMPUTERS & GRAPHICS-UK, 2007, 31 (05): : 710 - 718
[25] MOMENT-BASED INFERENCE WITH STRATIFIED DATA
Tripathi, Gautam
ECONOMETRIC THEORY, 2011, 27 (01) : 47 - 73
[26] MOMENT-BASED CRITERIA FOR DETERMINING BIOEQUIVALENCE
HOLDER, DJ
HSUAN, F
BIOMETRIKA, 1993, 80 (04) : 835 - 846
[27] Moment-based tail index estimation
McElroy, Tucker
Politis, Dimitris N.
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (04) : 1389 - 1406
[28] Multiscale moment-based painterly rendering
Nehab, D
Velho, L
SIBGRAPI 2002: XV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2002, : 244 - 251
[29] On generalization in moment-based domain adaptation
Werner Zellinger
Bernhard A. Moser
Susanne Saminger-Platz
Annals of Mathematics and Artificial Intelligence, 2021, 89 : 333 - 369
[30] Moment-based techniques for image retrieval
Di Ruberto, Cecilia
Morgera, Andrea
DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 155 - 159

← 1 2 3 4 5 →