Efficient computation of limit spectra of sample covariance matrices

被引:23
|
作者
Dobriban, Edgar [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Limiting spectral distribution; sample covariance matrix; Stieltjes transform; numerical computation; high-dimensional statistics; DIMENSIONAL RANDOM MATRICES; DETERMINISTIC EQUIVALENT; SINGULAR-VALUES; CONVERGENCE; FRAMEWORK; ALGORITHM; CHANNELS;
D O I
10.1142/S2010326315500197
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Models from random matrix theory (RMT) are increasingly used to gain insights into the behavior of statistical methods under high-dimensional asymptotics. However, the applicability of the framework is limited by numerical problems. Consider the usual model of multivariate statistics where the data is a sample from a multivariate distribution with a given covariance matrix. Under high-dimensional asymptotics, there is a deterministic map from the distribution of eigenvalues of the population covariance matrix (the population spectral distribution or PSD), to the of empirical spectral distribution (ESD). The current methods for computing this map are inefficient, and this limits the applicability of the theory. We propose a new method to compute numerically the ESD from an arbitrary input PSD. Our method, called SPECTRODE, finds the support and the density of the ESD to high precision; we prove this for finite discrete distributions. In computational experiments SPECTRODE outperforms existing methods by orders of magnitude in speed and accuracy. We apply it to compute expectations and contour integrals of the ESD, which are often central in applications. We also illustrate that SPECTRODE is directly useful in statistical problems, such as estimation and hypothesis testing for covariance matrices. Our proposal, implemented in open source software, may broaden the use of RMT in high-dimensional data analysis.
引用
收藏
页数:36
相关论文
共 50 条
  • [21] Efficient computation of the super-sample covariance for stage IV galaxy surveys
    Lacasa, Fabien
    Aubert, Marie
    Baratta, Philippe
    Carron, Julien
    Gorce, Adelie
    Beauchamps, Sylvain Gouyou
    Legrand, Louis
    Dizgah, Azadeh Moradinezhad
    Tutusaus, Isaac
    ASTRONOMY & ASTROPHYSICS, 2023, 671
  • [22] Characteristic Polynomials of Sample Covariance Matrices
    Koesters, H.
    JOURNAL OF THEORETICAL PROBABILITY, 2011, 24 (02) : 545 - 576
  • [23] Functional CLT for sample covariance matrices
    Bai, Zhidong
    Wang, Xiaoying
    Zhou, Wang
    BERNOULLI, 2010, 16 (04) : 1086 - 1113
  • [24] Linear Pooling of Sample Covariance Matrices
    Raninen, Elias
    Tyler, David E.
    Ollila, Esa
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 659 - 672
  • [25] On the statistics of eigenvectors of sample covariance matrices
    Friedlander, B
    THIRTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 1297 - 1301
  • [26] On the principal components of sample covariance matrices
    Alex Bloemendal
    Antti Knowles
    Horng-Tzer Yau
    Jun Yin
    Probability Theory and Related Fields, 2016, 164 : 459 - 552
  • [27] Spectral properties of sample covariance matrices
    Serdobolskii, VI
    THEORY OF PROBABILITY AND ITS APPLICATIONS, 1996, 40 (04) : 777 - 786
  • [28] Efficient Computation of the Joint Sample Frequency Spectra for Multiple Populations
    Kamm, John A.
    Terhorst, Jonathan
    Song, Yun S.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (01) : 182 - 194
  • [29] Central limit theorem for linear spectral statistics of large dimensional separable sample covariance matrices
    Bai, Zhidong
    Li, Huiqin
    Pan, Guangming
    BERNOULLI, 2019, 25 (03) : 1838 - 1869
  • [30] Central Limit Theorem for Linear Eigenvalue Statistics for a Tensor Product Version of Sample Covariance Matrices
    A. Lytova
    Journal of Theoretical Probability, 2018, 31 : 1024 - 1057