ROBUST ACCELERATED GRADIENT METHODS FOR SMOOTH STRONGLY CONVEX FUNCTIONS

被引:29
|
作者
Aybat, Necdet Serhat [1 ]
Fallah, Alireza [2 ]
Gurbuzbalaban, Mert [3 ]
Ozdaglar, Asuman [2 ]
机构
[1] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[3] Rutgers State Univ, Dept Management Sci & Informat Syst, Piscataway, NJ 08854 USA
关键词
convex optimization; stochastic approximation; robust control theory; accelerated methods; Nesterov's method; matrix inequalities; STOCHASTIC-APPROXIMATION ALGORITHMS; OPTIMIZATION ALGORITHMS; COMPOSITE OPTIMIZATION; H-2;
D O I
10.1137/19M1244925
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We study the trade-offs between convergence rate and robustness to gradient errors in designing a first-order algorithm. We focus on gradient descent and accelerated gradient (AG) methods for minimizing strongly convex functions when the gradient has random errors in the form of additive white noise. With gradient errors, the function values of the iterates need not converge to the optimal value; hence, we define the robustness of an algorithm to noise as the asymptotic expected suboptimality of the iterate sequence to input noise power. For this robustness measure, we provide exact expressions for the quadratic case using tools from robust control theory and tight upper bounds for the smooth strongly convex case using Lyapunov functions certified through matrix inequalities. We use these characterizations within an optimization problem which selects parameters of each algorithm to achieve a particular trade-off between rate and robustness. Our results show that AG can achieve acceleration while being more robust to random gradient errors. This behavior is quite different than previously reported in the deterministic gradient noise setting. We also establish some connections between the robustness of an algorithm and how quickly it can converge back to the optimal solution if it is perturbed from the optimal point with deterministic noise. Our framework also leads to practical algorithms that can perform better than other state-of-the-art methods in the presence of random gradient noise.
引用
收藏
页码:717 / 751
页数:35
相关论文
共 50 条
  • [1] Accelerated Distributed Nesterov Gradient Descent for Smooth and Strongly Convex Functions
    Qu, Guannan
    Li, Na
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 209 - 216
  • [2] A Robust Accelerated Optimization Algorithm for Strongly Convex Functions
    Cyrus, Saman
    Hu, Bin
    Van Scoy, Bryan
    Lessard, Laurent
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1376 - 1381
  • [3] An Extended Gradient Method for Smooth and Strongly Convex Functions
    Zhang, Xuexue
    Liu, Sanyang
    Zhao, Nannan
    MATHEMATICS, 2023, 11 (23)
  • [4] Accelerated Distributed Nesterov Gradient Descent for Convex and Smooth Functions
    Qu, Guannan
    Li, Na
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [5] Accelerated Bregman proximal gradient methods for relatively smooth convex optimization
    Hanzely, Filip
    Richtarik, Peter
    Xiao, Lin
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2021, 79 (02) : 405 - 440
  • [6] Accelerated Bregman proximal gradient methods for relatively smooth convex optimization
    Filip Hanzely
    Peter Richtárik
    Lin Xiao
    Computational Optimization and Applications, 2021, 79 : 405 - 440
  • [7] An optimal gradient method for smooth strongly convex minimization
    Taylor, Adrien
    Drori, Yoel
    MATHEMATICAL PROGRAMMING, 2023, 199 (1-2) : 557 - 594
  • [8] An optimal gradient method for smooth strongly convex minimization
    Adrien Taylor
    Yoel Drori
    Mathematical Programming, 2023, 199 : 557 - 594
  • [9] Nonlinear conjugate gradient for smooth convex functions
    Karimi, Sahar
    Vavasis, Stephen A.
    MATHEMATICAL PROGRAMMING COMPUTATION, 2024, 16 (02) : 229 - 254
  • [10] On the Convergence Rate of Quasi-Newton Methods on Strongly Convex Functions with Lipschitz Gradient
    Krutikov, Vladimir
    Tovbis, Elena
    Stanimirovic, Predrag
    Kazakovtsev, Lev
    MATHEMATICS, 2023, 11 (23)