Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [31] Optimized tracking control based on reinforcement learning for a class of high-order unknown nonlinear dynamic systems
    Wen, Guoxing
    Niu, Ben
    INFORMATION SCIENCES, 2022, 606 : 368 - 379
  • [32] Robust control scheme for a class of uncertain nonlinear systems with completely unknown dynamics using data-driven reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Cui, Yang
    Xiao, Geyang
    NEUROCOMPUTING, 2018, 273 : 68 - 77
  • [33] Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3231 - 3242
  • [34] Input and output quantized feedback control for a class of Markov jump systems with partially unknown transition probabilities
    Sun W.
    Liu Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2019, 41 (08): : 1858 - 1864
  • [35] Adaptive Fault-Tolerant Tracking Control for Markov Jump Systems with Partly Unknown Transition Probability
    Fan, Quanyong
    Ye, Dan
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2270 - 2275
  • [36] Online Reinforcement Learning for Self-adaptive Information Systems
    Palm, Alexander
    Metzger, Andreas
    Pohl, Klaus
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2020, 2020, 12127 : 169 - 184
  • [37] Adaptive Dynamic Programming with Reinforcement Learning on Optimization of Flight Departure Scheduling
    Liu, Hong
    Li, Song
    Sun, Fang
    Fan, Wei
    Ip, Wai-Hung
    Yung, Kai-Leung
    AEROSPACE, 2024, 11 (09)
  • [38] Non-zero-sum games of discrete-time Markov jump systems with unknown dynamics: An off-policy reinforcement learning method
    Zhang, Xuewen
    Shen, Hao
    Li, Feng
    Wang, Jing
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (02) : 949 - 968
  • [39] ONLINE LEARNING AND OPTIMIZATION OF MARKOV JUMP LINEAR MODELS
    Baltaoglu, Sevi
    Tong, Lang
    Zhao, Qing
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2289 - 2293
  • [40] Adaptive Learning with Unknown Information Flows
    Gur, Yonatan
    Momeni, Ahmadreza
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31