Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems

被引:0
|
作者
Costa, OLV [1 ]
Aya, JCC [1 ]
机构
[1] Univ Sao Paulo, Dept Engn Telecomunicac & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil
关键词
TD(lambda) methods; jump systems; Markov parameters; optimal control; Monte Carlo simulations;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(lambda) algorithms for Markovian decision processes to develop a TD(lambda) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented.
引用
收藏
页码:1183 / 1188
页数:6
相关论文
共 50 条