Lower bounds for finding stationary points II: first-order methods

被引:30
|
作者
Carmon, Yair [1 ]
Duchi, John C. [1 ,2 ]
Hinder, Oliver [3 ]
Sidford, Aaron [3 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Non-convex optimization; Information-based complexity; Dimension-free rates; Gradient methods; Accelerated gradient descent;
D O I
10.1007/s10107-019-01431-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We establish lower bounds on the complexity of finding similar to-stationary points of smooth, non-convex high-dimensional functions using first-order methods. We prove that deterministic first-order methods, even applied to arbitrarily smooth functions, cannot achieve convergence rates in similar to better than similar to -8/5, which is within similar to-1/15 log 1 similar to of the best known rate for such methods. Moreover, for functions with Lipschitz first and second derivatives, we prove that no deterministic first-order method can achieve convergence rates better than similar to -12/7, while similar to -2 is a lower bound for functions with only Lipschitz gradient. For convex functions with Lipschitz gradient, accelerated gradient descent achieves a better rate, showing that finding stationary points is easier given convexity.
引用
收藏
页码:315 / 355
页数:41
相关论文
共 50 条
  • [41] First-order, stationary mean-field games with congestion
    Evangelista, David
    Ferreira, Rita
    Gomes, Diogo A.
    Nurbekyan, Levon
    Voskanyan, Vardan
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2018, 173 : 37 - 74
  • [42] FIRST-ORDER PENALTY METHODS FOR BILEVEL OPTIMIZATION
    Lu, Zhaosong
    Mei, Sanyou
    SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (02) : 1937 - 1969
  • [43] Distributed Learning Systems with First-Order Methods
    Liu, Ji
    Zhang, Ce
    FOUNDATIONS AND TRENDS IN DATABASES, 2020, 9 (01): : 1 - 100
  • [44] First-order resolution methods for modal logics
    1600, Springer Verlag (7797 LNCS):
  • [45] First-order methods for sparse covariance selection
    D'Aspremont, Alexandre
    Banerjee, Onureena
    El Ghaoui, Laurent
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (01) : 56 - 66
  • [46] Optimum Predictor in Stationary First-order Moving Average Process
    Saber, Mohammad Mehdi
    Khorshidian, Kavoos
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE, 2021, 45 (05): : 1757 - 1764
  • [47] Optimum Predictor in Stationary First-order Moving Average Process
    Mohammad Mehdi Saber
    Kavoos Khorshidian
    Iranian Journal of Science and Technology, Transactions A: Science, 2021, 45 : 1757 - 1764
  • [48] Control Interpretations for First-Order Optimization Methods
    Hu, Bin
    Lessard, Laurent
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 3114 - 3119
  • [49] Scalable First-Order Methods for Robust MDPs
    Grand-Clement, Julien
    Kroer, Christian
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12086 - 12094
  • [50] Lower bounds to the first-order gradient corrections in the gradient expansion of the kinetic- and exchange-energy functionals
    Tao, JM
    Li, JM
    PHYSICAL REVIEW A, 1996, 54 (05): : 3859 - 3862