Approximate Dynamic Programming via a Smoothed Linear Program

被引：34

作者：

Desai, Vijay V. ^{[1
]}

Farias, Vivek F. ^{[2
]}

Moallemi, Ciamac C. ^{[3
]}

机构：

[1] Columbia Univ, Dept Ind Engn & Operat Res, New York, NY 10027 USA

[2] MIT, Sloan Sch Management, Cambridge, MA 02139 USA

[3] Columbia Univ, Grad Sch Business, New York, NY 10027 USA

来源：

OPERATIONS RESEARCH | 2012年 / 60卷 / 03期

关键词：

CONVERGENCE; POLICIES;

D O I：

10.1287/opre.1120.1044

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP have typically relied on a natural "projection" of a well-studied linear program for exact dynamic programming. Such programs restrict attention to approximations that are lower bounds to the optimal cost-to-go function. Our program-the "smoothed approximate linear program"-is distinct from such approaches and relaxes the restriction to lower bounding approximations in an appropriate fashion while remaining computationally tractable. Doing so appears to have several advantages: First, we demonstrate bounds on the quality of approximation to the optimal cost-to-go function afforded by our approach. These bounds are, in general, no worse than those available for extant LP approaches and for specific problem instances can be shown to be arbitrarily stronger. Second, experiments with our approach on a pair of challenging problems (the game of Tetris and a queueing network control problem) show that the approach outperforms the existing LP approach (which has previously been shown to be competitive with several ADP algorithms) by a substantial margin.

引用

页码：655 / 674

页数：20

共 50 条

[31] Perspectives of approximate dynamic programming
Powell, Warren B.
ANNALS OF OPERATIONS RESEARCH, 2016, 241 (1-2) : 319 - 356
[32] Cooperative Navigation for Heterogeneous Autonomous Vehicles via Approximate Dynamic Programming
Ferrari, Silvia
Anderson, Michael
Fierro, Rafael
Lu, Wenjie
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 121 - 127
[33] A Survey of Approximate Dynamic Programming
Wang Lin
Peng Hui
Zhu Hua-yong
Shen Lin-cheng
2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 396 - 399
[34] SOLVING A GENERAL DISCOUNTED DYNAMIC PROGRAM BY LINEAR-PROGRAMMING
HEILMANN, WR
ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1979, 48 (03): : 339 - 346
[35] Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆
He, Kanghui
Shi, Shengling
van den Boom, Ton
De Schutter, Bart
AUTOMATICA, 2024, 160
[36] Approximate dynamic programming for stochastic linear control problems on compact state spaces
Woerner, Stefan
Laumanns, Marco
Zenklusen, Rico
Fertis, Apostolos
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 241 (01) : 85 - 98
[37] AN APPROXIMATE ALGORITHM FOR DISCRETE LINEAR PROGRAMMING
BIONDI, E
SCHMID, R
IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS, 1969, SSC5 (01): : 65 - &
[38] An algorithm for approximate multiparametric linear programming
Filippi, C
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 120 (01) : 73 - 95
[39] Robust smoothed analysis of a condition number for linear programming
Peter Bürgisser
Dennis Amelunxen
Mathematical Programming, 2012, 131 : 221 - 251
[40] AN APPROXIMATE METHOD OF INTEGER LINEAR PROGRAMMING
FINKELSHTEYN, YY
ENGINEERING CYBERNETICS, 1968, (01): : 36 - +

← 1 2 3 4 5 →