An accelerated first-order method with complexity analysis for solving cubic regularization subproblems

被引:7
|
作者
Jiang, Rujun [1 ]
Yue, Man-Chung [2 ]
Zhou, Zhishuo [1 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[2] Hong Kong Polytech Univ, Dept Appl Math, Hung Hom, Hong Kong, Peoples R China
关键词
Cubic regularization subproblem; First-order methods; Constrained convex optimization; Complexity analysis;
D O I
10.1007/s10589-021-00274-7
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We propose a first-order method to solve the cubic regularization subproblem (CRS) based on a novel reformulation. The reformulation is a constrained convex optimization problem whose feasible region admits an easily computable projection. Our reformulation requires computing the minimum eigenvalue of the Hessian. To avoid the expensive computation of the exact minimum eigenvalue, we develop a surrogate problem to the reformulation where the exact minimum eigenvalue is replaced with an approximate one. We then apply first-order methods such as the Nesterov's accelerated projected gradient method (APG) and projected Barzilai-Borwein method to solve the surrogate problem. As our main theoretical contribution, we show that when an epsilon-approximate minimum eigenvalue is computed by the Lanczos method and the surrogate problem is approximately solved by APG, our approach returns an epsilon-approximate solution to CRS in (O) over tilde(epsilon(-1/2)) matrix-vector multiplications (where (O) over tilde(.) hides the logarithmic factors). Numerical experiments show that our methods are comparable to and outperform the Krylov subspace method in the easy and hard cases, respectively. We further implement our methods as subproblem solvers of adaptive cubic regularization methods, and numerical results show that our algorithms are comparable to the state-of-the-art algorithms.
引用
收藏
页码:471 / 506
页数:36
相关论文
共 50 条
  • [41] On the Parameterized Complexity of Learning First-Order Logic
    van Bergerem, Steffen
    Grohe, Martin
    Ritzert, Martin
    PROCEEDINGS OF THE 41ST ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS (PODS '22), 2022, : 337 - 346
  • [42] Complexity of Existential Positive First-Order Logic
    Bodirsky, Manuel
    Hermann, Miki
    Richoux, Florian
    MATHEMATICAL THEORY AND COMPUTATIONAL PRACTICE, 2009, 5635 : 31 - 36
  • [43] A first-order continuous method for the Antipin regularization of monotone variational inequalities in a Banach space
    Ryazantseva I.P.
    Computational Mathematics and Mathematical Physics, 2006, 46 (7) : 1121 - 1131
  • [44] A general first-order global sensitivity analysis method
    Xu, Chonggang
    Gertner, George Zdzislaw
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2008, 93 (07) : 1060 - 1071
  • [45] First-order differential power analysis on the duplication method
    Fumaroli, Guillaume
    Mayer, Emmanuel
    Dubois, Renaud
    PROGRESS IN CRYPTOLOGY - INDOCRYPT 2007, 2007, 4859 : 210 - 223
  • [46] A deep First-Order System Least Squares method for solving elliptic PDEs
    Bersetche, Francisco M.
    Borthagaray, Juan Pablo
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2023, 129 : 136 - 150
  • [47] AUTOMATIC SOLVING OF DIFFERENTIAL EQUATIONS OF FIRST-ORDER BY METHOD OF DEVELOPMENT IN TAYLOR SERIES
    KACHRILLO, L
    REVUE FRANCAISE D AUTOMATIQUE INFORMATIQUE RECHERCHE OPERATIONNELLE, 1972, 6 (SEP): : 161 - 179
  • [48] A first-order method for solving bilevel convex optimization problems in Banach space
    Guan, Wei-Bo
    Song, Wen
    OPTIMIZATION, 2023, : 2221 - 2246
  • [49] Second Derivative Multistep Method For Solving First-Order Ordinary Differential Equations
    Turki, Mohammed Yousif
    Ismail, Fudziah
    Senu, Norazak
    Ibrahim, Zarina Bibi
    INNOVATIONS THROUGH MATHEMATICAL AND STATISTICAL RESEARCH: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON MATHEMATICAL SCIENCES AND STATISTICS (ICMSS2016), 2016, 1739
  • [50] A variable step implicit block multistep method for solving first-order ODEs
    Mehrkanoon, S.
    Majid, Z. A.
    Suleiman, M.
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2010, 233 (09) : 2387 - 2394