Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [11] Reinforcement learning-based composite suboptimal control for Markov jump singularly perturbed systems with unknown dynamics
    Li, Wenqian
    Jia, Guolong
    Wang, Yun
    Su, Lei
    Shen, Hao
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2024, 47 (14) : 11551 - 11564
  • [12] Adaptive output regulation of a class of nonlinear systems with completely unknown parameters
    Ding, ZT
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 1566 - 1571
  • [13] Adaptive learning nonsynchronous control of nonlinear hidden Markov jump systems with limited mode information
    Ma, Chao
    Gao, Hang
    Wu, Wei
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (11): : 6746 - 6762
  • [14] Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
    Zhong, Xiangnan
    He, Haibo
    Zhang, Huaguang
    Wang, Zhanshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2141 - 2155
  • [15] Reinforcement learning-based linear quadratic tracking control for partially unknown Markov jump singular interconnected systems
    Jia, Guolong
    Yang, Qing
    Liu, Jinxu
    Shen, Hao
    APPLIED MATHEMATICS AND COMPUTATION, 2025, 491
  • [16] Robust fault detection of singular Markov jump systems with partially unknown information
    Yin, Yanyan
    Shi, Jiangbin
    Liu, Fei
    Liu, Yanqing
    INFORMATION SCIENCES, 2020, 537 (537) : 368 - 379
  • [17] REINFORCEMENT LEARNING CONTROL OF UNKNOWN DYNAMIC-SYSTEMS
    WU, QH
    PUGH, AC
    IEE PROCEEDINGS-D CONTROL THEORY AND APPLICATIONS, 1993, 140 (05): : 313 - 322
  • [18] Reinforcement Learning-Based Near Optimization for Continuous-Time Markov Jump Singularly Perturbed Systems
    Wang, Jing
    Peng, Chuanjun
    Park, Ju H.
    Shen, Hao
    Shi, Kaibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (06) : 2026 - 2030
  • [19] Imitation-Based Reinforcement Learning for Markov Jump Systems and Its Application
    Wu, Jiacheng
    Wang, Jing
    Shen, Hao
    Park, Ju H.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (08) : 3810 - 3819
  • [20] Certified data-driven inverse reinforcement learning of Markov jump systems
    Xue, Wenqian
    Lewis, Frank L.
    Lian, Bosen
    AUTOMATICA, 2025, 176