Approximate Dynamic Programming via a Smoothed Linear Program

被引:34
|
作者
Desai, Vijay V. [1 ]
Farias, Vivek F. [2 ]
Moallemi, Ciamac C. [3 ]
机构
[1] Columbia Univ, Dept Ind Engn & Operat Res, New York, NY 10027 USA
[2] MIT, Sloan Sch Management, Cambridge, MA 02139 USA
[3] Columbia Univ, Grad Sch Business, New York, NY 10027 USA
关键词
CONVERGENCE; POLICIES;
D O I
10.1287/opre.1120.1044
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP have typically relied on a natural "projection" of a well-studied linear program for exact dynamic programming. Such programs restrict attention to approximations that are lower bounds to the optimal cost-to-go function. Our program-the "smoothed approximate linear program"-is distinct from such approaches and relaxes the restriction to lower bounding approximations in an appropriate fashion while remaining computationally tractable. Doing so appears to have several advantages: First, we demonstrate bounds on the quality of approximation to the optimal cost-to-go function afforded by our approach. These bounds are, in general, no worse than those available for extant LP approaches and for specific problem instances can be shown to be arbitrarily stronger. Second, experiments with our approach on a pair of challenging problems (the game of Tetris and a queueing network control problem) show that the approach outperforms the existing LP approach (which has previously been shown to be competitive with several ADP algorithms) by a substantial margin.
引用
收藏
页码:655 / 674
页数:20
相关论文
共 50 条
  • [31] Perspectives of approximate dynamic programming
    Powell, Warren B.
    ANNALS OF OPERATIONS RESEARCH, 2016, 241 (1-2) : 319 - 356
  • [32] Cooperative Navigation for Heterogeneous Autonomous Vehicles via Approximate Dynamic Programming
    Ferrari, Silvia
    Anderson, Michael
    Fierro, Rafael
    Lu, Wenjie
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 121 - 127
  • [33] A Survey of Approximate Dynamic Programming
    Wang Lin
    Peng Hui
    Zhu Hua-yong
    Shen Lin-cheng
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 396 - 399
  • [34] SOLVING A GENERAL DISCOUNTED DYNAMIC PROGRAM BY LINEAR-PROGRAMMING
    HEILMANN, WR
    ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1979, 48 (03): : 339 - 346
  • [35] Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆
    He, Kanghui
    Shi, Shengling
    van den Boom, Ton
    De Schutter, Bart
    AUTOMATICA, 2024, 160
  • [36] Approximate dynamic programming for stochastic linear control problems on compact state spaces
    Woerner, Stefan
    Laumanns, Marco
    Zenklusen, Rico
    Fertis, Apostolos
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 241 (01) : 85 - 98
  • [37] AN APPROXIMATE ALGORITHM FOR DISCRETE LINEAR PROGRAMMING
    BIONDI, E
    SCHMID, R
    IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS, 1969, SSC5 (01): : 65 - &
  • [38] An algorithm for approximate multiparametric linear programming
    Filippi, C
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 120 (01) : 73 - 95
  • [39] Robust smoothed analysis of a condition number for linear programming
    Peter Bürgisser
    Dennis Amelunxen
    Mathematical Programming, 2012, 131 : 221 - 251
  • [40] AN APPROXIMATE METHOD OF INTEGER LINEAR PROGRAMMING
    FINKELSHTEYN, YY
    ENGINEERING CYBERNETICS, 1968, (01): : 36 - +