Receding horizon approach to Markov games for infinite horizon discounted cost

被引:0
|
作者
Chang, HS [1 ]
Marcus, SI [1 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a receding horizon approach as an approximate solution to two-person zero-sum Markov games with an infinite horizon discounted cost criterion. We first present error bounds from the optimal equilibrium value of the game when both players take "correlated" receding horizon policies that are based on exact or approximate solutions of receding finite horizon subgames. Motivated by the worst-case optimal control of queueing systems by Altman [1], we then analyze error bounds when the minimizer plays the (approximate) receding horizon control and the maximizer plays the worst case policy. We give two heuristic examples of the approximate receding horizon control. We extend "parallel rollout" and "hindsight optimization" by Chang et al. [11, 13] into the Markov game setting within the framework of the approximate receding horizon approach and analyze their performances. From the parallel rollout approach, the minimizing player seeks to combine dynamically multiple heuristic policies in a set to improve the performances of all of the heuristic policies simultaneously under the guess that the maximizing player has chosen a fixed worst-case policy. Given epsilon > 0, we give the value of the receding horizon which guarantees that the parallel rollout policy with the horizon played by the minimizer "dominates" any heuristic policy in the set by c. From the hindsight optimization approach, the minimizing player makes a decision based on his expected optimal hindsight performance over a finite horizon. We finally discuss practical implementations of the receding horizon approaches via simulation.
引用
收藏
页码:1380 / 1385
页数:6
相关论文
共 50 条
  • [1] Homotopy approach for infinite horizon discounted Markov decision processes
    White, Douglas John
    ZOR. Zeitschrift fur Operations-Research, 1996, 43 (03): : 353 - 372
  • [2] A framework for receding-horizon control in infinite-horizon aggregative games
    Fele, Filiberto
    De Paola, Antonio
    Angeli, David
    Strbac, Goran
    ANNUAL REVIEWS IN CONTROL, 2018, 45 : 191 - 204
  • [3] Zero-sum infinite-horizon discounted piecewise deterministic Markov games
    Huang, Yonghui
    Lian, Zhaotong
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2023, 97 (02) : 179 - 205
  • [4] Zero-sum infinite-horizon discounted piecewise deterministic Markov games
    Yonghui Huang
    Zhaotong Lian
    Xianping Guo
    Mathematical Methods of Operations Research, 2023, 97 : 179 - 205
  • [5] Decision roll and horizon roll processes in infinite horizon discounted Markov decision processes
    White, DJ
    MANAGEMENT SCIENCE, 1996, 42 (01) : 37 - 50
  • [6] On the Infinite Horizon Performance of Receding Horizon Controllers
    Gruene, Lars
    Rantzer, Anders
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (09) : 2100 - 2111
  • [7] Discounted Cost Infinite Time Horizon Cumulant Control
    Diersing, Ronald W.
    Won, Chang-Hee
    Sain, Michael K.
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 1040 - +
  • [8] Infinite horizon linear quadratic tracking problem: A discounted cost function approach
    Birgani, Soleiman Najafi
    Moaveni, Bijan
    Khaki-Sedigh, Ali
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2018, 39 (04): : 1549 - 1572
  • [9] Two-person zero-sum Markov games: Receding horizon approach
    Chang, HS
    Marcus, SI
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) : 1951 - 1961