Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [41] Cooperative control for swarming systems based on reinforcement learning in unknown dynamic environment
    Lan, Xuejing
    Liu, Yiwen
    Zhao, Zhijia
    NEUROCOMPUTING, 2020, 410 (410) : 410 - 418
  • [42] Adaptive H-infinity tracking control for a class of stochastic nonlinear systems with completely unknown functions
    Li X.-H.
    Liu H.
    Liu X.-P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (09): : 1431 - 1441
  • [43] Asynchronous Adaptive Fault-Tolerant Control for Markov Jump Systems with Actuator Failures and Unknown Nonlinear Disturbances
    Li, Jian-Ning
    Gu, Kai-Yang
    Liu, Xiao
    Xu, Xiao-Bin
    COMPLEXITY, 2020, 2020
  • [44] Adaptive robust control of a class of dynamic delay systems with unknown uncertainty bounds
    Oucheriah, S
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2001, 15 (01) : 53 - 63
  • [45] H∞ optimal output tracking control for Markov jump systems: A reinforcement learning-based approach
    Shen, Ying
    Yao, Cai-Kang
    Chen, Bo
    Che, Wei-Wei
    Wu, Zheng-Guang
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (08) : 5149 - 5167
  • [46] Reinforcement Learning-Based Control for a Class of Nonlinear Systems with unknown control directions
    Song, Xiaoling
    Huang, Miao
    Wen, Gang
    Ma, Longhua
    Yao, Jiaqing
    Lu, Zheming
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2519 - 2524
  • [47] Continuous reinforcement learning to robust fault tolerant control for a class of unknown nonlinear systems
    Farivar, Faezeh
    Ahmadabadi, Majid Nili
    APPLIED SOFT COMPUTING, 2015, 37 : 702 - 714
  • [48] Optimal control of a class of nonlinear dynamic systems based on reinforcement learning
    Chen, Xue-Song
    Liu, Fu-Chun
    Kongzhi yu Juece/Control and Decision, 2013, 28 (12): : 1889 - 1893
  • [49] Universal adaptive stabilization for a class of multivariable Markovian jump linear systems with partially unknown transition rates
    Berdouzi, Driss
    El Hadri, Kamal
    El Bouhtouri, Abdelmoula
    IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2021, 38 (03) : 957 - 991
  • [50] Adaptive Interleaved Reinforcement Learning: Robust Stability of Affine Nonlinear Systems With Unknown Uncertainty
    Li, Jinna
    Ding, Jinliang
    Chai, Tianyou
    Lewis, Frank L.
    Jagannathan, Sarangapani
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 270 - 280