Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems

被引:0
|
作者
Costa, OLV [1 ]
Aya, JCC [1 ]
机构
[1] Univ Sao Paulo, Dept Engn Telecomunicac & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil
关键词
TD(lambda) methods; jump systems; Markov parameters; optimal control; Monte Carlo simulations;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(lambda) algorithms for Markovian decision processes to develop a TD(lambda) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented.
引用
收藏
页码:1183 / 1188
页数:6
相关论文
共 50 条
  • [41] Stabilization and Optimization of Discrete-Time Markovian Jump Linear Systems via Mode Feedback Control
    Zhu, Jin
    Xia, Kai
    Ling, Qiang
    Chen, Wei
    Dullerud, Geir E.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6505 - 6520
  • [42] H∞ control for discrete-time Markovian jump linear systems with partially uncertain transition probabilities
    Sun, Hui-Jie
    Zhang, Ying
    Wu, Ai-Guo
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2020, 41 (05): : 1796 - 1809
  • [43] H2-Control and the Separation Principle for Discrete-Time Markovian Jump Linear Systems
    O. L. V. Costa
    E. F. Tuesta
    Mathematics of Control, Signals and Systems, 2004, 16 : 320 - 350
  • [44] State and Mode Feedback Control for Discrete-time Markovian Jump Linear Systems With Controllable MTPM
    Zhu, Jin
    Ding, Qin
    Spiryagin, Maksym
    Xie, Wanqing
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 830 - 837
  • [45] H2-control and the separation principle for discrete-time Markovian jump linear systems
    Costa, OLV
    Tuesta, EF
    MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2004, 16 (04) : 320 - 350
  • [46] Model Predictive Control of Discrete-time Markovian Jump Positive Systems
    Mehrivash, Hamed
    Hadavand, Ehsan
    Shafiei, Mohammad Hosein
    Zarei, Jafar
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 834 - 839
  • [47] Robust H∞ control of descriptor discrete-time Markovian jump systems
    Lam, J.
    Shu, Z.
    Xu, S.
    Boukas, E. -K.
    INTERNATIONAL JOURNAL OF CONTROL, 2007, 80 (03) : 374 - 385
  • [48] Asynchronous H∞ Control for Positive Discrete-time Markovian Jump Systems
    Hui Shang
    Wenhai Qi
    Guangdeng Zong
    International Journal of Control, Automation and Systems, 2020, 18 : 431 - 438
  • [49] Asynchronous Control for Discrete-Time Markovian Jump Systems With Multiplicative Noise
    Wang, Shitong
    Wu, Zheng-Guang
    Shi, Peng
    Wu, Zhaojing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (07) : 2480 - 2484
  • [50] Impulsive H∞ control of discrete-time Markovian jump delay systems
    Zhang, Yu
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 661 - 666