Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [21] Robust Fault Detection of Nonlinear Singular Markov Jump Systems with Partially Unknown Information
    Shi, Jiangbin
    Yin, Yanyan
    Liu, Yanqing
    Liu, Fei
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 1805 - 1810
  • [22] H∞ Filtering for Uncertain Periodic Markov Jump Systems with Periodic and Partly Unknown Information
    Zhu, Lijie
    Yin, Yanyan
    Liu, Fei
    Wang, Song
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (10) : 4200 - 4214
  • [23] Adaptive Neural Network Tracking for a Class of Markov Jump Stochastic Nonlinear Systems Based on Extreme Learning Machine
    Wang, Pengpeng
    Long, Fei
    Tan, Yi
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 2429 - 2434
  • [24] Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
    Ota, Kei
    Jha, Devesh K.
    Oiki, Tomoaki
    Miura, Mamoru
    Nammoto, Takashi
    Nikovski, Daniel
    Mariyama, Toshisada
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3487 - 3494
  • [25] Consensus-based distributed information filter for a class of jump Markov systems
    Li, W.
    Jia, Y.
    IET CONTROL THEORY AND APPLICATIONS, 2011, 5 (10): : 1214 - 1222
  • [26] Novel adaptive learning control of linear systems with completely unknown time delays
    Chen W.-S.
    Int. J. Autom. Comput., 2009, 2 (177-185): : 177 - 185
  • [27] Novel Adaptive Learning Control of Linear Systems with Completely Unknown Time Delays
    Wei-Sheng Chen Department of Applied Mathematics
    International Journal of Automation & Computing , 2009, (02) : 177 - 185
  • [28] Adaptive control of linear Markov jump systems
    Cheng, Daizhan
    Zhang, Lijun
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2006, 37 (07) : 477 - 483
  • [29] A Novel Resilient Control Scheme for a Class of Markovian Jump Systems With Partially Unknown Information
    Zhang, Kun
    Su, Rong
    Zhang, Huaguang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8191 - 8200
  • [30] Tracking control optimization scheme for a class of partially unknown fuzzy systems by using integral reinforcement learning architecture
    Zhang, Kun
    Zhang, Huaguang
    Mu, Yunfei
    Sun, Shaoxin
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 359 : 344 - 356