Approximate dynamic programming via linear programming

被引:0
|
作者
de Farias, DP [1 ]
Van Roy, B [1 ]
机构
[1] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The curse of dimensionality gives rise to prohibitive computational requirements that render infeasible the exact solution of large-scale stochastic control problems. We study an efficient method based on linear programming for approximating solutions to such problems. The approach "fits" a linear combination of pre-selected basis functions to the dynamic programming cost-to-go function. We develop bounds on the approximation error and present experimental results in the domain of queueing network control, providing empirical support for the methodology.
引用
收藏
页码:689 / 695
页数:7
相关论文
共 50 条
  • [1] A LINEAR PROGRAMMING METHODOLOGY FOR APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Henry
    Sala, Antonio
    Armesto, Leopoldo
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (02) : 363 - 375
  • [2] The linear programming approach to approximate dynamic programming
    De Farias, DP
    Van Roy, B
    OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
  • [3] Approximate Dynamic Programming via a Smoothed Linear Program
    Desai, Vijay V.
    Farias, Vivek F.
    Moallemi, Ciamac C.
    OPERATIONS RESEARCH, 2012, 60 (03) : 655 - 674
  • [4] Approximate Dynamic Programming via Sum of Squares Programming
    Summers, Tyler H.
    Kunz, Konstantin
    Kariotoglou, Nikolaos
    Kamgarpour, Maryam
    Summers, Sean
    Lygeros, John
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 191 - 197
  • [5] On constraint sampling in the linear programming approach to approximate dynamic programming
    de Farias, DP
    Van Roy, B
    MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) : 462 - 478
  • [6] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
    Darbha, S.
    Krishnamoorthy, K.
    Pachter, M.
    Chandler, P.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
  • [7] Data-driven approximate dynamic programming: A linear programming approach
    Sutter, Tobias
    Kamoutsi, Angeliki
    Esfahani, Peyman Mohajerin
    Lygeros, John
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [8] Choosing the Cost Vector of the Linear Programming Approach to Approximate Dynamic Programming
    de Farias, Daniela Pucci
    Weber, Theophane
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 67 - 72
  • [9] Markdown Optimization via Approximate Dynamic Programming
    Cosgun, Ozlem
    Kula, Ufuk
    Kahraman, Cengiz
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2013, 6 (01) : 64 - 78
  • [10] Approximate Dynamic Programming via Penalty Functions
    Beuchat, Paul N.
    Lygeros, John
    IFAC PAPERSONLINE, 2017, 50 (01): : 11814 - 11821