Adaptive Negative Curvature Descent with Applications in Non-convex Optimization

被引:0
|
作者
Liu, Mingrui [1 ]
Li, Zhe [1 ]
Wang, Xiaoyu [2 ]
Yi, Jinfeng [3 ]
Yang, Tianbao [1 ]
机构
[1] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
[2] Intellifusion, Parlin, NJ USA
[3] JD AI Res, Stanford, CA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Negative curvature descent (NCD) method has been utilized to design deterministic or stochastic algorithms for non-convex optimization aiming at finding second-order stationary points or local minima In existing studies, NCD needs to approximate the smallest eigen-value of the Hessian matrix with a sufficient precision (e.g., epsilon(2) << 1) in order to achieve a sufficiently accurate second-order stationary solution (i.e., lambda(min)(del(2)f(x)) >= -epsilon(2)). One issue with this approach is that the target precision epsilon(2) is usually set to be very small in order to find a high quality solution, which increases the complexity for computing a negative curvature. To address this issue, we propose an adaptive NCD to allow an adaptive error dependent on the current gradient's magnitude in approximating the smallest eigen-value of the Hessian, and to encourage competition between a noisy NCD step and gradient descent step. We consider the applications of the proposed adaptive NCD for both deterministic and stochastic non-convex optimization, and demonstrate that it can help reduce the the overall complexity in computing the negative curvatures during the course of optimization without sacrificing the iteration complexity.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Adaptive Stochastic Gradient Descent Method for Convex and Non-Convex Optimization
    Chen, Ruijuan
    Tang, Xiaoquan
    Li, Xiuting
    FRACTAL AND FRACTIONAL, 2022, 6 (12)
  • [2] Evolutionary Gradient Descent for Non-convex Optimization
    Xue, Ke
    Qian, Chao
    Xu, Ling
    Fei, Xudong
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3221 - 3227
  • [3] Adaptive control via non-convex optimization
    Staus, GH
    Biegler, LT
    Ydstie, BE
    STATE OF THE ART IN GLOBAL OPTIMIZATION: COMPUTATIONAL METHODS AND APPLICATIONS, 1996, 7 : 119 - 137
  • [4] LAGRANGE MULTIPLIERS IN NON-CONVEX OPTIMIZATION AND APPLICATIONS
    AUBIN, JP
    CLARKE, FH
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1977, 285 (06): : 451 - 454
  • [5] Equilibrated adaptive learning rates for non-convex optimization
    Dauphin, Yann N.
    de Vries, Harm
    Bengio, Yoshua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [6] A non-convex adaptive regularization approach to binary optimization
    Cerone, V
    Fosson, S. M.
    Regruto, D.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3844 - 3849
  • [7] DESCENT METHOD ON NON-CONVEX CLOSED SET
    MALIVERT, C
    BULLETIN DE LA SOCIETE MATHEMATIQUE DE FRANCE, 1979, MEM (60): : 113 - 124
  • [8] On the Parallelization Upper Bound for Asynchronous Stochastic Gradients Descent in Non-convex Optimization
    Wang, Lifu
    Shen, Bo
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 196 (03) : 900 - 935
  • [9] On the Parallelization Upper Bound for Asynchronous Stochastic Gradients Descent in Non-convex Optimization
    Lifu Wang
    Bo Shen
    Journal of Optimization Theory and Applications, 2023, 196 : 900 - 935
  • [10] Non-convex scenario optimization
    Garatti, Simone
    Campi, Marco C.
    MATHEMATICAL PROGRAMMING, 2025, 209 (1-2) : 557 - 608