An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems

被引：0

作者：

Chang, HS ^{[1
]}

Fu, MC ^{[1
]}

Marcus, SI ^{[1
]}

机构：

[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS | 2003年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a novel algorithm, called "Simulated Annealing Multiplicative Weights", for approximately solving large (discrete-time) finite-horizon stochastic dynamic programming problems. The algorithm is "asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases. The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only. on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice.

引用

页码：3818 / 3823

页数：6

共 50 条

[1] An asymptotically efficient simulation-based algorithm for finite horizon stochastic dynamic programming
Chang, Hyeong Soo
Fu, Michael C.
Hu, Jiaqiao
Marcus, Steven I.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (01) : 89 - 94
[2] On Subspace Decompositions of Finite Horizon Dynamic Programming Problems
Tsakiris, Manolis C.
Tarraf, Danielle C.
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 1890 - 1895
[3] NEW ALGORITHM OF DYNAMIC PROGRAMMING FOR STOCHASTIC PROBLEMS SOLUTION
Dokuchaev, A. V.
Kotenko, A. P.
VESTNIK SAMARSKOGO GOSUDARSTVENNOGO TEKHNICHESKOGO UNIVERSITETA-SERIYA-FIZIKO-MATEMATICHESKIYE NAUKI, 2008, (02): : 203 - 209
[4] An efficient algorithm for large scale stochastic nonlinear programming problems
Shastri, Y
Diwekar, U
COMPUTERS & CHEMICAL ENGINEERING, 2006, 30 (05) : 864 - 877
[5] An Efficient Impulsive Adaptive Dynamic Programming Algorithm for Stochastic Systems
Liang, Mingming
Wang, Yonghua
Liu, Derong
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (09) : 5545 - 5559
[6] An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems
Zhang, Heng
JOURNAL OF APPLIED MATHEMATICS AND COMPUTING, 2023, 69 (03) : 2741 - 2760
[7] An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems
Heng Zhang
Journal of Applied Mathematics and Computing, 2023, 69 : 2741 - 2760
[8] INFINITE TIME HORIZON STOCHASTIC RECURSIVE CONTROL PROBLEMS WITH JUMPS: DYNAMIC PROGRAMMING AND STOCHASTIC VERIFICATION THEOREMS
Luo, Sheng
Li, Xun
Wei, Qingmeng
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2025, 63 (02) : 796 - 821
[9] Adaptive dynamic programming for terminally constrained finite-horizon optimal control problems
Andrews, L.
Klotz, J. R.
Kamalapurkar, R.
Dixon, W. E.
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 5095 - 5100
[10] Storage modeling and approximate dynamic programming algorithm for stochastic dynamic economic dispatch problems
Jian, Ganyang
Liu, Mingbo
Lin, Shunjiang
Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2014, 34 (25): : 4333 - 4340

← 1 2 3 4 5 →