Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems

被引：0

作者：

Costa, OLV ^{[1
]}

Aya, JCC ^{[1
]}

机构：

[1] Univ Sao Paulo, Dept Engn Telecomunicac & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil

来源：

PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5 | 2000年

关键词：

TD(lambda) methods; jump systems; Markov parameters; optimal control; Monte Carlo simulations;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(lambda) algorithms for Markovian decision processes to develop a TD(lambda) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented.

引用

页码：1183 / 1188

页数：6

共 50 条

[21] Constrained quadratic state feedback control of discrete-time Markovian jump linear systems
Costa, OLV
Assumpcao, EO
Boukas, EK
Marques, RP
AUTOMATICA, 1999, 35 (04) : 617 - 626
[22] Robust H2-control for discrete-time Markovian jump linear systems
Costa, OLV
Marques, RP
PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 746 - 750
[23] Robust H2-control for discrete-time Markovian jump linear systems
Costa, OLV
Marques, RP
INTERNATIONAL JOURNAL OF CONTROL, 2000, 73 (01) : 11 - 21
[24] Predictive control of linear discrete-time Markovian jump systems by learning recurrent patterns
Han, SooJean
Chung, Soon -Jo
Doyle, John C.
AUTOMATICA, 2023, 156
[25] Control of Discrete-Time Markovian Jump Linear Systems Subject to Partially Observed Chains
Cerri, Joao P.
Terra, Marco H.
2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1609 - 1614
[26] STOCHASTIC OPTIMAL TRACKING WITH PREVIEW BY STATE FEEDBACK FOR LINEAR DISCRETE-TIME MARKOVIAN JUMP SYSTEMS
Nakura, Gou
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (01): : 15 - 27
[27] Optimal guaranteed cost filtering for Markovian jump discrete-time systems
Mahmoud, MS
Shi, P
MATHEMATICAL PROBLEMS IN ENGINEERING, 2004, (01) : 33 - 48
[28] OPTIMAL FILTERING IN DISCRETE-TIME SYSTEMS WITH TIME DELAYS AND MARKOVIAN JUMP PARAMETERS
Han, Chunyan
Zhang, Huanshui
ANZIAM JOURNAL, 2009, 51 (02): : 218 - 233
[29] Array algorithm for filtering of discrete-time Markovian jump linear systems
Terra, Marco H.
Ishihara, Joao Y.
Junior, Antonio P.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (07) : 1293 - 1296
[30] Array algorithm for filtering of discrete-time Markovian jump linear systems
Terra, Marco H.
Ishihara, Joao Y.
Junior, Antonio P.
2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 : 982 - +

← 1 2 3 4 5 →