Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization

被引:23
|
作者
Li, Xingguo [1 ]
Lu, Junwei [2 ]
Arora, Raman [3 ]
Haupt, Jarvis [4 ]
Liu, Han [5 ]
Wang, Zhaoran [6 ]
Zhao, Tuo [7 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Harvard Univ, Dept Biostat, Boston, MA 02138 USA
[3] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[4] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA
[5] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[6] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60208 USA
[7] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
关键词
Strict saddle problem; global landscape; nonconvex matrix factorization; matrix sensing; invariant group; LOW-RANK; SPARSE PCA; COMPLETION; ALGORITHM; BOUNDS;
D O I
10.1109/TIT.2019.2898663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a general theory for studying the landscape of nonconvex optimization with underlying symmetric structures for a class of machine learning problems (e.g., lowr-ank matrix factorization, phase retrieval, and deep linear neural networks). In particular, we characterize the locations of stationary points and the null space of Hessian matrices of the objective function via the lens of invariant groups. As a major motivating example, we apply the proposed general theory to characterize the global landscape of the nonconvex optimization in low-rank matrix factorization problem. We illustrate how the rotational symmetry group gives rise to infinitely many nonisolated strict saddle points and equivalent global minima of the objective function. By explicitly identifying all stationary points, we divide the entire parameter space into three regions: (R-1) the region containing the neighborhoods of all strict saddle points where the objective has negative curvature; (R-2) the region containing neighborhoods of all global minima, where the objective enjoys strong convexity along certain directions; and (R-3) the complement of the above regions, where the gradient has sufficiently large magnitude. We further extend our result to the matrix sensing problem. Such global landscape implies that strong global convergence guarantees for popular iterative algorithms with arbitrary initial solutions.
引用
收藏
页码:3489 / 3514
页数:26
相关论文
共 50 条
  • [1] Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization
    Li, Xingguo
    Haupt, Jarvis
    Lu, Junwei
    Wang, Zhaoran
    Arora, Raman
    Liu, Han
    Zhao, Tuo
    2018 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2018,
  • [2] Local Saddle Points and Convexification for Nonconvex Optimization Problems
    Z. K. Xu
    Journal of Optimization Theory and Applications, 1997, 94 : 739 - 746
  • [3] Local saddle points and convexification for nonconvex optimization problems
    Xu, ZK
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1997, 94 (03) : 739 - 746
  • [4] On saddle points of augmented Lagrangians for constrained nonconvex optimization
    Sun, XL
    Li, D
    McKinnon, KIM
    SIAM JOURNAL ON OPTIMIZATION, 2005, 15 (04) : 1128 - 1146
  • [5] Saddle points of general augmented Lagrangians for constrained nonconvex optimization
    Wu, H. X.
    Luo, H. Z.
    JOURNAL OF GLOBAL OPTIMIZATION, 2012, 53 (04) : 683 - 697
  • [6] On Nonconvex Optimization for Machine Learning: Gradients, Stochasticity, and Saddle Points
    Jin, Chi
    Netrapalli, Praneeth
    Ge, Rong
    Kakade, Sham M.
    Jordan, Michael, I
    JOURNAL OF THE ACM, 2021, 68 (02)
  • [7] Saddle points of general augmented Lagrangians for constrained nonconvex optimization
    H. X. Wu
    H. Z. Luo
    Journal of Global Optimization, 2012, 53 : 683 - 697
  • [8] Remarks on saddle points in nonconvex sets
    Park, S
    Kim, IS
    APPLIED MATHEMATICS LETTERS, 2000, 13 (01) : 111 - 113
  • [9] A NEWTON-BASED METHOD FOR NONCONVEX OPTIMIZATION WITH FAST EVASION OF SADDLE POINTS
    Paternain, Santiago
    Mokhtari, Aryan
    Ribeiro, Alejandro
    SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (01) : 343 - 368
  • [10] Saddle points on nonconvex subsets of topological spaces
    Smolyakov, ER
    DOKLADY AKADEMII NAUK, 1997, 355 (04) : 453 - 454