Second Order Methods for Optimizing Convex Matrix Functions and Sparse Covariance Clustering

被引:1
|
作者
Chin, Gillian M. [1 ]
Nocedal, Jorge [1 ]
Olsen, Peder A. [2 ]
Rennie, Steven J. [2 ]
机构
[1] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60208 USA
[2] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
Convexity; clustering; FISTA; Hessian structure; Jeffreys divergence; Kullback Leibler divergence; LASSO; Newton's method; THRESHOLDING ALGORITHM;
D O I
10.1109/TASL.2013.2263142
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A variety of first-order methods have recently been proposed for solving matrix optimization problems arising in machine learning. The premise for utilizing such algorithms is that second order information is too expensive to employ, and so simple first-order iterations are likely to be optimal. In this paper, we argue that second-order information is in fact efficiently accessible in many matrix optimization problems, and can be effectively incorporated into optimization algorithms. We begin by reviewing how certain Hessian operations can be conveniently represented in a wide class of matrix optimization problems, and provide the first proofs for these results. Next we consider a concrete problem, namely the minimization of the l(1) regularized Jeffreys divergence, and derive formulae for computing Hessians and Hessian vector products. This allows us to propose various second order methods for solving the Jeffreys divergence problem. We present extensive numerical results illustrating the behavior of the algorithms and apply the methods to a speech recognition problem. We compress full covariance Gaussian mixture models utilized for acoustic models in automatic speech recognition. By discovering clusters of (sparse inverse) covariance matrices, we can compress the number of covariance parameters by a factor exceeding 200, while still outperforming the word error rate (WER) performance of a diagonal covariance model that has 20 times less covariance parameters than the original acoustic model.
引用
收藏
页码:2244 / 2254
页数:11
相关论文
共 50 条
  • [31] Higher-order tensor methods for minimizing difference of convex functions
    Automatic Control and Systems Engineering Department, National University of Science and Technology Politehnica Bucharest, Spl. Independentei 313, Bucharest
    060042, Romania
    不详
    050711, Romania
    arXiv,
  • [32] Second Hankel Determinant of Logarithmic Coefficients of Convex and Starlike Functions of Order Alpha
    Kowalczyk, Bogumila
    Lecko, Adam
    BULLETIN OF THE MALAYSIAN MATHEMATICAL SCIENCES SOCIETY, 2022, 45 (02) : 727 - 740
  • [33] SECOND-ORDER SLICE-DERIVATIVE OF CONVEX FUNCTIONS IN NORMED SPACES
    Mohammad, Yara
    Soueycatt, Mohamed
    PACIFIC JOURNAL OF OPTIMIZATION, 2019, 15 (01): : 131 - 143
  • [34] Results on Second-Order Hankel Determinants for Convex Functions with Symmetric Points
    Ullah, Khalil
    Al-Shbeil, Isra
    Faisal, Muhammad Imran
    Arif, Muhammad
    Alsaud, Huda
    SYMMETRY-BASEL, 2023, 15 (04):
  • [35] Second Hankel Determinant of Logarithmic Coefficients of Convex and Starlike Functions of Order Alpha
    Bogumiła Kowalczyk
    Adam Lecko
    Bulletin of the Malaysian Mathematical Sciences Society, 2022, 45 : 727 - 740
  • [36] Second-order covariance matrix of maximum likelihood estimates in generalized linear models
    Cordeiro, GM
    STATISTICS & PROBABILITY LETTERS, 2004, 66 (02) : 153 - 160
  • [37] LS-CMA-ES: A second-order algorithm for covariance matrix adaptation
    Auger, A
    Schoenauer, M
    Vanhaecke, N
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN VIII, 2004, 3242 : 182 - 191
  • [38] Faster Differentially Private Convex Optimization via Second-Order Methods
    Ganesh, Arun
    Haghifam, Mahdi
    Steinke, Thomas
    Thakurta, Abhradeep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [39] Some Cases of Efficient Factorization of Second-Order Matrix Functions
    Kiyasov, S. N.
    RUSSIAN MATHEMATICS, 2012, 56 (06) : 30 - 36
  • [40] A Class of Holder Matrix Functions of the Second Order Admitting Effective Factorization
    Kiyasov, S. N.
    RUSSIAN MATHEMATICS, 2022, 66 (10) : 56 - 61