Approximating networks, dynamic programming and stochastic approximation

被引:0
|
作者
Baglietto, M [1 ]
Cervellera, C [1 ]
Parisini, T [1 ]
Sanguineti, M [1 ]
Zoppoli, R [1 ]
机构
[1] Univ Genoa, Dept Commun Comp & Syst Sci, DIST, I-16145 Genoa, Italy
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate solution of a general N-stage stochastic optimal control problem is considered. It is known that discretizing uniformly the state components in applying dynamic programming may lead this procedure to incur the "curse of dimensionality". Approximating networks, i.e., linear combinations of parametrized basis functions provided with density properties in some normed linear spaces, are then defined and used in two approximate methods (examples of such networks are neural networks with one hidden layer and linear output activation functions, radial basis functions, etc.). The brat one consists of approximating the optimal cost-to-go functions in dynamic programming (such a technique is known in literature era "neuro-dynamic programming"); the second method reduces the original functional optimization problem to a nonlinear programming one that is solved by means of stochastic approximation, Approximating networks of suitable types benefit by the property that the number of parameters to be optimized and the number of samples to be used for approximating some classes of regular functions increase only linearly (or moderately) with the dimensions of the arguments of the functions and the number of samples used to train the networks. We deem that such properties may enable us to solve N-stage stochastic optimal problems often avoiding the curse of dimensionality. The two methods are tested and compared in an example involving a 10-dimension state vector.
引用
收藏
页码:3304 / 3308
页数:5
相关论文
共 50 条
  • [21] Stochastic convexity in dynamic programming
    Atakan, AE
    ECONOMIC THEORY, 2003, 22 (02) : 447 - 455
  • [22] Stochastic Lipschitz dynamic programming
    Ahmed, Shabbir
    Cabral, Filipe Goulart
    Paulo da Costa, Bernardo Freitas
    MATHEMATICAL PROGRAMMING, 2022, 191 (02) : 755 - 793
  • [23] Stochastic programming in dynamic reliability
    Bourgeois, F
    Labeau, PE
    SAFETY AND RELIABILITY, VOLS 1 & 2, 1999, : 913 - 919
  • [24] Stochastic viability and dynamic programming
    Doyen, Luc
    De Lara, Michel
    SYSTEMS & CONTROL LETTERS, 2010, 59 (10) : 629 - 634
  • [25] Stochastic Differential Dynamic Programming
    Theodorou, Evangelos
    Tassa, Yuval
    Todorov, Emo
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 1125 - 1132
  • [26] Stochastic Approximation Proximal Method of Multipliers for Convex Stochastic Programming
    Zhang, Liwei
    Zhang, Yule
    Xiao, Xiantao
    Wu, Jia
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (01) : 177 - 193
  • [27] Numerical evaluation of approximation methods in stochastic programming
    Kuechler, Christian
    Vigerske, Stefan
    OPTIMIZATION, 2010, 59 (03) : 401 - 415
  • [28] APPROXIMATION OF DECISION RULES IN STOCHASTIC-PROGRAMMING
    LEPP, RE
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 1993, 31 (02) : 46 - 49
  • [29] ON THE APPROXIMATION OF STOCHASTIC CONVEX-PROGRAMMING PROBLEMS
    LEPP, R
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 81 : 427 - 434
  • [30] Approximating the solution of a dynamic, stochastic multiple knapsack problem
    Hartman, Joseph C.
    Perry, Thomas C.
    CONTROL AND CYBERNETICS, 2006, 35 (03): : 535 - 550