Adaptive stochastic approximation by the simultaneous perturbation method

被引:276
|
作者
Spall, JC [1 ]
机构
[1] Johns Hopkins Univ, Appl Phys Lab, Laurel, MD 20723 USA
关键词
adaptive estimation; optimization; parameter estimation; root-finding; simultaneous perturbation stochastic approximation (SPSA); stochastic approximation;
D O I
10.1109/TAC.2000.880982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stochastic approximation (SA! has long been applied for problems of minimizing loss functions or root finding with noisy input information. As with all stochastic search algorithms, there are adjustable algorithm coefficients that must be specified, and that can have a profound effect on algorithm performance. It is known that choosing these coefficients according to an SA analog of the deterministic Newton-Raphson algorithm provides an optimal or near-optimal form of the algorithm. However, directly determining the required Hessian matrix (or Jacobian matrix for root finding) to achieve this algorithm form has often been difficult or impossible in practice, This paper presents a general adaptive SA algorithm that is based on a simple method for estimating the Hessian matrix, while concurrently estimating the primary parameters of interest. The approach applies in both the gradient-free optimization (Kiefer-Wolfowitz) and root-finding/stochastic gradient-based (Robbins-Monro) settings, and is based on the "simultaneous perturbation (SP)" idea introduced previously: The algorithm requires only a small number of loss function or gradient measurements per iteration-independent of the problem dimension-to adaptively estimate the Hessian and parameters of primary interest. Aside from introducing the adaptive SP approach, this paper presents practical implementation guidance, asymptotic theory, and a nontrivial numerical evaluation. Also included is a discussion and numerical analysis comparing the adaptive SP approach with the iterate-averaging approach to accelerated SA.
引用
收藏
页码:1839 / 1853
页数:15
相关论文
共 50 条
  • [21] Global random optimization by simultaneous perturbation stochastic approximation
    Maryak, John L.
    Chin, Daniel C.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (03) : 780 - 783
  • [22] Performance of Simultaneous Perturbation Stochastic Approximation for Feature Selection
    Algin, Ramazan
    Alkaya, Ali Fuat
    Agaoglu, Mustafa
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 2, 2022, 505 : 348 - 354
  • [23] Global random optimization by Simultaneous Perturbation Stochastic Approximation
    Maryak, JL
    Chin, DC
    JOHNS HOPKINS APL TECHNICAL DIGEST, 2004, 25 (02): : 91 - 100
  • [24] Global random optimization by simultaneous perturbation stochastic approximation
    Maryak, JL
    Chin, DC
    WSC'01: PROCEEDINGS OF THE 2001 WINTER SIMULATION CONFERENCE, VOLS 1 AND 2, 2001, : 307 - 312
  • [25] System identification via simultaneous perturbation stochastic approximation
    Hirokami, T
    Maeda, Y
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 1231 - 1232
  • [26] Simultaneous Perturbation Stochastic Approximation for Automatic Speech Recognition
    Stein, Daniel
    Schwenninger, Jochen
    Stadtschnitzer, Michael
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 622 - 626
  • [27] Simultaneous Perturbation Stochastic Approximation of the Quantum Fisher Information
    Gacon, Julien
    Zoufal, Christa
    Carleo, Giuseppe
    Woerner, Stefan
    QUANTUM, 2021, 5
  • [28] Constrained optimization via stochastic approximation with a simultaneous perturbation gradient approximation
    Sadegh, P
    AUTOMATICA, 1997, 33 (05) : 889 - 892
  • [29] MULTIVARIATE STOCHASTIC-APPROXIMATION USING A SIMULTANEOUS PERTURBATION GRADIENT APPROXIMATION
    SPALL, JC
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1992, 37 (03) : 332 - 341
  • [30] Constrained optimization via stochastic approximation with a simultaneous perturbation gradient approximation
    Sadegh, P
    (SYSID'97): SYSTEM IDENTIFICATION, VOLS 1-3, 1998, : 281 - 285