Adaptive Negative Curvature Descent with Applications in Non-convex Optimization

被引:0
|
作者
Liu, Mingrui [1 ]
Li, Zhe [1 ]
Wang, Xiaoyu [2 ]
Yi, Jinfeng [3 ]
Yang, Tianbao [1 ]
机构
[1] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
[2] Intellifusion, Parlin, NJ USA
[3] JD AI Res, Stanford, CA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Negative curvature descent (NCD) method has been utilized to design deterministic or stochastic algorithms for non-convex optimization aiming at finding second-order stationary points or local minima In existing studies, NCD needs to approximate the smallest eigen-value of the Hessian matrix with a sufficient precision (e.g., epsilon(2) << 1) in order to achieve a sufficiently accurate second-order stationary solution (i.e., lambda(min)(del(2)f(x)) >= -epsilon(2)). One issue with this approach is that the target precision epsilon(2) is usually set to be very small in order to find a high quality solution, which increases the complexity for computing a negative curvature. To address this issue, we propose an adaptive NCD to allow an adaptive error dependent on the current gradient's magnitude in approximating the smallest eigen-value of the Hessian, and to encourage competition between a noisy NCD step and gradient descent step. We consider the applications of the proposed adaptive NCD for both deterministic and stochastic non-convex optimization, and demonstrate that it can help reduce the the overall complexity in computing the negative curvatures during the course of optimization without sacrificing the iteration complexity.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Gradient Methods for Non-convex Optimization
    Prateek Jain
    Journal of the Indian Institute of Science, 2019, 99 : 247 - 256
  • [22] Replica Exchange for Non-Convex Optimization
    Dong, Jing
    Tong, Xin T.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [23] Replica exchange for non-convex optimization
    Dong, Jing
    Tong, Xin T.
    1600, Microtome Publishing (22):
  • [24] Robust Optimization for Non-Convex Objectives
    Chen, Robert
    Lucier, Brendan
    Singer, Yaron
    Syrgkanis, Vasilis
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [25] EXISTENCE THEOREMS IN NON-CONVEX OPTIMIZATION
    AUBERT, G
    TAHRAOUI, R
    APPLICABLE ANALYSIS, 1984, 18 (1-2) : 75 - 100
  • [26] CLASS OF NON-CONVEX OPTIMIZATION PROBLEMS
    HIRCHE, J
    TAN, HK
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1977, 57 (04): : 247 - 253
  • [27] Accelerated algorithms for convex and non-convex optimization on manifolds
    Lin, Lizhen
    Saparbayeva, Bayan
    Zhang, Michael Minyi
    Dunson, David B.
    MACHINE LEARNING, 2025, 114 (03)
  • [28] Convex and Non-convex Optimization Under Generalized Smoothness
    Li, Haochuan
    Qian, Jian
    Tian, Yi
    Rakhlin, Alexander
    Jadbabaie, Ali
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [29] ON A CLASS OF NON-CONVEX FUNCTIONALS AND THEIR APPLICATIONS
    TAHRAOUI, R
    SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 1990, 21 (01) : 37 - 52
  • [30] Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent
    Hu, Wenqing
    Li, Chris Junchi
    Lian, Xiangru
    Liu, Ji
    Yuan, Huizhuo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32