Approximating networks, dynamic programming and stochastic approximation

被引：0

作者：

Baglietto, M ^{[1
]}

Cervellera, C ^{[1
]}

Parisini, T ^{[1
]}

Sanguineti, M ^{[1
]}

Zoppoli, R ^{[1
]}

机构：

[1] Univ Genoa, Dept Commun Comp & Syst Sci, DIST, I-16145 Genoa, Italy

来源：

PROCEEDINGS OF THE 2000 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2000年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Approximate solution of a general N-stage stochastic optimal control problem is considered. It is known that discretizing uniformly the state components in applying dynamic programming may lead this procedure to incur the "curse of dimensionality". Approximating networks, i.e., linear combinations of parametrized basis functions provided with density properties in some normed linear spaces, are then defined and used in two approximate methods (examples of such networks are neural networks with one hidden layer and linear output activation functions, radial basis functions, etc.). The brat one consists of approximating the optimal cost-to-go functions in dynamic programming (such a technique is known in literature era "neuro-dynamic programming"); the second method reduces the original functional optimization problem to a nonlinear programming one that is solved by means of stochastic approximation, Approximating networks of suitable types benefit by the property that the number of parameters to be optimized and the number of samples to be used for approximating some classes of regular functions increase only linearly (or moderately) with the dimensions of the arguments of the functions and the number of samples used to train the networks. We deem that such properties may enable us to solve N-stage stochastic optimal problems often avoiding the curse of dimensionality. The two methods are tested and compared in an example involving a 10-dimension state vector.

引用

页码：3304 / 3308

页数：5

共 50 条

[41] A DYNAMIC STOCHASTIC-APPROXIMATION METHOD
DUPAC, V
ANNALS OF MATHEMATICAL STATISTICS, 1965, 36 (06): : 1695 - 1702
[42] Stochastic dynamic programming with factored representations
Boutilier, C
Dearden, R
Goldszmidt, M
ARTIFICIAL INTELLIGENCE, 2000, 121 (1-2) : 49 - 107
[43] LIMITS TO STOCHASTIC DYNAMIC-PROGRAMMING
MACE, RH
SUTHERLAND, WJ
BEHAVIORAL AND BRAIN SCIENCES, 1991, 14 (01) : 101 - 101
[44] Galerkin methods in dynamic stochastic programming
Koivu, Matti
Pennanen, Teemu
OPTIMIZATION, 2010, 59 (03) : 339 - 354
[45] Dynamic Programming in Convex Stochastic Optimization
Pennanen, Teemu
Perkkioe, Ari-Pekka
JOURNAL OF CONVEX ANALYSIS, 2023, 30 (04) : 1241 - 1283
[46] METHODS FOR SIMULATION IN STOCHASTIC DYNAMIC PROGRAMMING
QUADRAT, JP
VIOT, M
REVUE FRANCAISE D AUTOMATIQUE INFORMATIQUE RECHERCHE OPERATIONNELLE, 1973, 7 (NR1): : 3 - 22
[47] Complexity of stochastic dual dynamic programming
Lan, Guanghui
MATHEMATICAL PROGRAMMING, 2022, 191 (02) : 717 - 754
[48] Approximate dynamic programming for stochastic reachability
Kariotoglou, Nikolaos
Summers, Sean
Summers, Tyler
Kamgarpour, Maryam
Lygeros, John
2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 584 - 589
[49] Diagnostic checking in stochastic dynamic programming
Huang, Wen-Cheng
Wu, Chian Min
Journal of Water Resources Planning and Management, 1993, 119 (04) : 490 - 494
[50] Diffusion approximation for signaling stochastic networks
Leite, Saul C.
Fragoso, Marcelo D.
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2013, 123 (08) : 2957 - 2982

← 1 2 3 4 5 →