Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems

被引:0
|
作者
Costa, OLV [1 ]
Aya, JCC [1 ]
机构
[1] Univ Sao Paulo, Dept Engn Telecomunicac & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil
关键词
TD(lambda) methods; jump systems; Markov parameters; optimal control; Monte Carlo simulations;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(lambda) algorithms for Markovian decision processes to develop a TD(lambda) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented.
引用
收藏
页码:1183 / 1188
页数:6
相关论文
共 50 条
  • [21] Constrained quadratic state feedback control of discrete-time Markovian jump linear systems
    Costa, OLV
    Assumpcao, EO
    Boukas, EK
    Marques, RP
    AUTOMATICA, 1999, 35 (04) : 617 - 626
  • [22] Robust H2-control for discrete-time Markovian jump linear systems
    Costa, OLV
    Marques, RP
    PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 746 - 750
  • [23] Robust H2-control for discrete-time Markovian jump linear systems
    Costa, OLV
    Marques, RP
    INTERNATIONAL JOURNAL OF CONTROL, 2000, 73 (01) : 11 - 21
  • [24] Predictive control of linear discrete-time Markovian jump systems by learning recurrent patterns
    Han, SooJean
    Chung, Soon -Jo
    Doyle, John C.
    AUTOMATICA, 2023, 156
  • [25] Control of Discrete-Time Markovian Jump Linear Systems Subject to Partially Observed Chains
    Cerri, Joao P.
    Terra, Marco H.
    2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1609 - 1614
  • [26] STOCHASTIC OPTIMAL TRACKING WITH PREVIEW BY STATE FEEDBACK FOR LINEAR DISCRETE-TIME MARKOVIAN JUMP SYSTEMS
    Nakura, Gou
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (01): : 15 - 27
  • [27] Optimal guaranteed cost filtering for Markovian jump discrete-time systems
    Mahmoud, MS
    Shi, P
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2004, (01) : 33 - 48
  • [28] OPTIMAL FILTERING IN DISCRETE-TIME SYSTEMS WITH TIME DELAYS AND MARKOVIAN JUMP PARAMETERS
    Han, Chunyan
    Zhang, Huanshui
    ANZIAM JOURNAL, 2009, 51 (02): : 218 - 233
  • [29] Array algorithm for filtering of discrete-time Markovian jump linear systems
    Terra, Marco H.
    Ishihara, Joao Y.
    Junior, Antonio P.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (07) : 1293 - 1296
  • [30] Array algorithm for filtering of discrete-time Markovian jump linear systems
    Terra, Marco H.
    Ishihara, Joao Y.
    Junior, Antonio P.
    2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 : 982 - +