A RANDOMIZED ALGORITHM FOR PRINCIPAL COMPONENT ANALYSIS

被引:249
|
作者
Rokhlin, Vladimir [1 ,2 ,3 ]
Szlam, Arthur [4 ]
Tygert, Mark [4 ]
机构
[1] Yale Univ, Dept Comp Sci, New Haven, CT 06511 USA
[2] Yale Univ, Dept Math, New Haven, CT 06511 USA
[3] Yale Univ, Dept Phys, New Haven, CT 06511 USA
[4] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
关键词
principal component analysis; singular value decomposition; low rank; Lanczos; power; MONTE-CARLO ALGORITHMS; LOW-RANK APPROXIMATION; MATRICES;
D O I
10.1137/080736417
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Principal component analysis (PCA) requires the computation of a low-rank approximation to a matrix containing the data being analyzed. In many applications of PCA, the best possible accuracy of any rank-deficient approximation is at most a few digits (measured in the spectral norm, relative to the spectral norm of the matrix being approximated). In such circumstances, efficient algorithms have not come with guarantees of good accuracy, unless one or both dimensions of the matrix being approximated are small. We describe an efficient algorithm for the low-rank approximation of matrices that produces accuracy that is very close to the best possible accuracy, for matrices of arbitrary sizes. We illustrate our theoretical results via several numerical examples.
引用
收藏
页码:1100 / 1124
页数:25
相关论文
共 50 条
  • [1] Algorithm 971: An Implementation of a Randomized Algorithm for Principal Component Analysis
    Li, Huamin
    Linderman, George C.
    Szlam, Arthur
    Stanton, Kelly P.
    Kluger, Yuval
    Tygert, Mark
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2017, 43 (03):
  • [2] Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
    Lin, Nan
    Jiang, Junhai
    Guo, Shicheng
    Xiong, Momiao
    PLOS ONE, 2015, 10 (07):
  • [3] Bat algorithm with principal component analysis
    Zhihua Cui
    Feixiang Li
    Wensheng Zhang
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 603 - 622
  • [4] Quantum principal component analysis algorithm
    Ruan, Yue
    Chen, Han-Wu
    Liu, Zhi-Hao
    Zhang, Jun
    Zhu, Wan-Ning
    Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (03): : 666 - 676
  • [5] An improvement algorithm of principal component analysis
    Yu Chuanqiang
    Guo Xiaosong
    Zhang An
    Pan Xingjie
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 529 - 534
  • [6] Bat algorithm with principal component analysis
    Cui, Zhihua
    Li, Feixiang
    Zhang, Wensheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (03) : 603 - 622
  • [7] Overview of principal component analysis algorithm
    Li, Lingjun
    Liu, Shigang
    Peng, Yali
    Sun, Zengguo
    OPTIK, 2016, 127 (09): : 3935 - 3944
  • [8] A New Principal Curve Algorithm for Nonlinear Principal Component Analysis
    Antory, David
    Kruger, Uwe
    Littler, Tim
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 1235 - 1246
  • [9] Randomized Method for Robust Principal Component Analysis
    Liu, Sanyang
    Zhang, Chong
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [10] Error analysis of the principal component analysis demodulation algorithm
    Vargas, J.
    Carazo, J. M.
    Sorzano, C. O. S.
    APPLIED PHYSICS B-LASERS AND OPTICS, 2014, 115 (03): : 355 - 364