Approximating networks, dynamic programming and stochastic approximation

被引:0
|
作者
Baglietto, M [1 ]
Cervellera, C [1 ]
Parisini, T [1 ]
Sanguineti, M [1 ]
Zoppoli, R [1 ]
机构
[1] Univ Genoa, Dept Commun Comp & Syst Sci, DIST, I-16145 Genoa, Italy
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate solution of a general N-stage stochastic optimal control problem is considered. It is known that discretizing uniformly the state components in applying dynamic programming may lead this procedure to incur the "curse of dimensionality". Approximating networks, i.e., linear combinations of parametrized basis functions provided with density properties in some normed linear spaces, are then defined and used in two approximate methods (examples of such networks are neural networks with one hidden layer and linear output activation functions, radial basis functions, etc.). The brat one consists of approximating the optimal cost-to-go functions in dynamic programming (such a technique is known in literature era "neuro-dynamic programming"); the second method reduces the original functional optimization problem to a nonlinear programming one that is solved by means of stochastic approximation, Approximating networks of suitable types benefit by the property that the number of parameters to be optimized and the number of samples to be used for approximating some classes of regular functions increase only linearly (or moderately) with the dimensions of the arguments of the functions and the number of samples used to train the networks. We deem that such properties may enable us to solve N-stage stochastic optimal problems often avoiding the curse of dimensionality. The two methods are tested and compared in an example involving a 10-dimension state vector.
引用
收藏
页码:3304 / 3308
页数:5
相关论文
共 50 条
  • [1] Approximating hydropower systems by feasibility spaces in stochastic dual dynamic programming
    Helseth, Arild
    ELECTRIC POWER SYSTEMS RESEARCH, 2024, 234
  • [2] Approximation of Stochastic Programming problems
    Choirat, C
    Hess, C
    Seri, R
    MONTE CARLO AND QUASI-MONTE CARLO METHODS 2004, 2006, : 45 - +
  • [3] ROBUST STOCHASTIC APPROXIMATION APPROACH TO STOCHASTIC PROGRAMMING
    Nemirovski, A.
    Juditsky, A.
    Lan, G.
    Shapiro, A.
    SIAM JOURNAL ON OPTIMIZATION, 2009, 19 (04) : 1574 - 1609
  • [4] A STOCHASTIC APPROXIMATION ALGORITHM FOR STOCHASTIC SEMIDEFINITE PROGRAMMING
    Gaujal, Bruno
    Mertikopoulos, Panayotis
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2016, 30 (03) : 431 - 454
  • [5] Stochastic Dynamic Programming Model for Revenue Optimization in Social Networks
    Hosein, Patrick
    Lawrence, Trisha
    2015 IEEE 11TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB), 2015, : 378 - 383
  • [6] Approximating Stochastic Quantum Noise Through Genetic Programming
    Muqeet, Asmar
    Ali, Shaukat
    Arcaini, Paolo
    SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2024, 2024, 14767 : 56 - 62
  • [7] Adaptive value function approximation for continuous-state stochastic dynamic programming
    Fan, Huiyuan
    Tarun, Prashant K.
    Chen, Victoria C. P.
    COMPUTERS & OPERATIONS RESEARCH, 2013, 40 (04) : 1076 - 1084
  • [8] A dynamic programming approximation for downlink channel allocation in cognitive femtocell networks
    Xiang, Xudong
    Wan, Jianxiong
    Lin, Chuang
    Chen, Xin
    COMPUTER NETWORKS, 2013, 57 (15) : 2976 - 2991
  • [9] PENALTY METHODS WITH STOCHASTIC APPROXIMATION FOR STOCHASTIC NONLINEAR PROGRAMMING
    Wang, Xiao
    Ma, Shiqian
    Yuan, Ya-Xiang
    MATHEMATICS OF COMPUTATION, 2017, 86 (306) : 1793 - 1820
  • [10] Notes on Approximation of Stochastic Programming Problem
    Smid, Martin
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE MATHEMATICAL METHODS IN ECONOMICS 2003, 2003, : 244 - 251