TURNPIKES IN FINITE MARKOV DECISION PROCESSES AND RANDOM WALK*

被引:0
|
作者
Piunovskiy, A. B. [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool, England
关键词
Markov decision process; discounted reward; average reward; random walk; stochastic knapsack problem; turnpike;
D O I
10.1137/S0040585X97T991325
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we revise the theory of turnpikes in discounted Markov decision pro-cesses, prove the turnpike theorem for the undiscounted model, and apply the results to the specific random walk.
引用
收藏
页码:123 / 149
页数:27
相关论文
共 50 条
  • [1] Markov decision processes with random horizon
    Iida, T
    Mori, M
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1996, 39 (04) : 592 - 603
  • [2] Metrics for finite Markov decision processes
    Ferns, N
    Panangaden, P
    Precup, D
    PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 950 - 951
  • [3] MARKOV DECISION PROCESSES WITH FINITE STATE AND DECISION SPACES
    RYKOV, VV
    THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1966, 11 (02): : 302 - &
  • [4] CLASSIFICATION OF A RANDOM-WALK DEFINED ON A FINITE MARKOV CHAIN
    NEWBOULD, M
    ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1973, 26 (02): : 95 - 104
  • [5] Dynamic Watermarking for Finite Markov Decision Processes
    Tang, Jiacheng
    Song, Jiguo
    Gupta, Abhishek
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2025, 4 : 41 - 52
  • [6] A Remark on Finite Horizon Markov Decision processes
    XikUi Wang (University of Saskatchewan
    Canada)
    经济数学, 1989, (05) : 76 - 80
  • [7] Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
    Turchetta, Matteo
    Berkenkamp, Felix
    Krause, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Random Markov decision processes for sustainable infrastructure systems
    Meidani, Hadi
    Ghanem, Roger
    STRUCTURE AND INFRASTRUCTURE ENGINEERING, 2015, 11 (05) : 655 - 667
  • [9] On optimality gaps for fuzzification in finite Markov decision processes
    Kageyama, Masayuki
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2008, 11 (01) : 77 - 88
  • [10] Measuring the Distance Between Finite Markov Decision Processes
    Song, Jinhua
    Gao, Yang
    Wang, Hao
    An, Bo
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 468 - 476