Moment-Based Reinforcement Learning for Ensemble Control

被引:2
|
作者
Yu, Yao-Chi [1 ]
Narayanan, Vignesh [2 ]
Li, Jr-Shin [1 ]
机构
[1] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63130 USA
[2] Univ South Carolina, AI Inst, Columbia, SC 29208 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Data-driven control; ensemble control systems; moment methods; reinforcement learning (RL); CONTROLLABILITY; CONVERGENCE; SYSTEMS;
D O I
10.1109/TNNLS.2023.3264151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Problems involving controlling the collective behavior of a population of structurally similar dynamical systems, the so-called ensemble control, arise in diverse emerging applications and pose a grand challenge in systems science and control engineering. Owing to the severely under-actuated nature and the difficulty of placing large-scale sensor networks, ensemble systems are limited to being actuated and monitored at the population level. Moreover, mathematical models describing the dynamics of ensemble systems are often elusive. Therefore, it is essential to design broadcast controls that excite the entire population in such a way that the heterogeneity in system dynamics is robustly compensated. In this article, we propose a reinforcement learning (RL)-based data-driven control framework incorporating population-level aggregated measurement data to learn a global control signal for steering a dynamic population in the desired manner. In particular, we introduce the notion of ensemble moments induced by aggregated measurements and derive the associated moment system to the original ensemble system. Then, using the moment system, we learn an approximation of optimal value functions and the associated policies in terms of ensemble moments through RL. We illustrate the feasibility and scalability of the proposed moment-based approach via numerical experiments using a population of linear, bilinear, and nonlinear dynamic ensemble systems. We report that the proposed method achieves the desired control objectives of various ensemble control tasks and obtains significantly better averaged-reward when compared with three existing methods.
引用
收藏
页码:12653 / 12664
页数:12
相关论文
共 50 条
  • [21] On generalization in moment-based domain adaptation
    Zellinger, Werner
    Moser, Bernhard A.
    Saminger-Platz, Susanne
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2021, 89 (3-4) : 333 - 369
  • [22] Third moment-based causal inference
    Wiedermann W.
    Behaviormetrika, 2022, 49 (2) : 303 - 328
  • [23] Moment-based estimation of stochastic volatility
    Bregantini, Daniele
    JOURNAL OF BANKING & FINANCE, 2013, 37 (12) : 4755 - 4764
  • [24] Moment-based metrics for mesh simplification
    Tang, H.
    Shu, H. Z.
    Dillenseger, J. L.
    Bao, X. D.
    Luo, L. M.
    COMPUTERS & GRAPHICS-UK, 2007, 31 (05): : 710 - 718
  • [25] MOMENT-BASED INFERENCE WITH STRATIFIED DATA
    Tripathi, Gautam
    ECONOMETRIC THEORY, 2011, 27 (01) : 47 - 73
  • [26] MOMENT-BASED CRITERIA FOR DETERMINING BIOEQUIVALENCE
    HOLDER, DJ
    HSUAN, F
    BIOMETRIKA, 1993, 80 (04) : 835 - 846
  • [27] Moment-based tail index estimation
    McElroy, Tucker
    Politis, Dimitris N.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (04) : 1389 - 1406
  • [28] Multiscale moment-based painterly rendering
    Nehab, D
    Velho, L
    SIBGRAPI 2002: XV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2002, : 244 - 251
  • [29] On generalization in moment-based domain adaptation
    Werner Zellinger
    Bernhard A. Moser
    Susanne Saminger-Platz
    Annals of Mathematics and Artificial Intelligence, 2021, 89 : 333 - 369
  • [30] Moment-based techniques for image retrieval
    Di Ruberto, Cecilia
    Morgera, Andrea
    DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 155 - 159