An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems

被引:0
|
作者
Chang, HS [1 ]
Fu, MC [1 ]
Marcus, SI [1 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea
来源
42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS | 2003年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel algorithm, called "Simulated Annealing Multiplicative Weights", for approximately solving large (discrete-time) finite-horizon stochastic dynamic programming problems. The algorithm is "asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases. The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only. on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice.
引用
收藏
页码:3818 / 3823
页数:6
相关论文
共 50 条
  • [1] An asymptotically efficient simulation-based algorithm for finite horizon stochastic dynamic programming
    Chang, Hyeong Soo
    Fu, Michael C.
    Hu, Jiaqiao
    Marcus, Steven I.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (01) : 89 - 94
  • [2] On Subspace Decompositions of Finite Horizon Dynamic Programming Problems
    Tsakiris, Manolis C.
    Tarraf, Danielle C.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 1890 - 1895
  • [3] NEW ALGORITHM OF DYNAMIC PROGRAMMING FOR STOCHASTIC PROBLEMS SOLUTION
    Dokuchaev, A. V.
    Kotenko, A. P.
    VESTNIK SAMARSKOGO GOSUDARSTVENNOGO TEKHNICHESKOGO UNIVERSITETA-SERIYA-FIZIKO-MATEMATICHESKIYE NAUKI, 2008, (02): : 203 - 209
  • [4] An efficient algorithm for large scale stochastic nonlinear programming problems
    Shastri, Y
    Diwekar, U
    COMPUTERS & CHEMICAL ENGINEERING, 2006, 30 (05) : 864 - 877
  • [5] An Efficient Impulsive Adaptive Dynamic Programming Algorithm for Stochastic Systems
    Liang, Mingming
    Wang, Yonghua
    Liu, Derong
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (09) : 5545 - 5559
  • [6] An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems
    Zhang, Heng
    JOURNAL OF APPLIED MATHEMATICS AND COMPUTING, 2023, 69 (03) : 2741 - 2760
  • [7] An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems
    Heng Zhang
    Journal of Applied Mathematics and Computing, 2023, 69 : 2741 - 2760
  • [8] INFINITE TIME HORIZON STOCHASTIC RECURSIVE CONTROL PROBLEMS WITH JUMPS: DYNAMIC PROGRAMMING AND STOCHASTIC VERIFICATION THEOREMS
    Luo, Sheng
    Li, Xun
    Wei, Qingmeng
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2025, 63 (02) : 796 - 821
  • [9] Adaptive dynamic programming for terminally constrained finite-horizon optimal control problems
    Andrews, L.
    Klotz, J. R.
    Kamalapurkar, R.
    Dixon, W. E.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 5095 - 5100
  • [10] Storage modeling and approximate dynamic programming algorithm for stochastic dynamic economic dispatch problems
    Jian, Ganyang
    Liu, Mingbo
    Lin, Shunjiang
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2014, 34 (25): : 4333 - 4340