Efficient computation of limit spectra of sample covariance matrices

被引:23
|
作者
Dobriban, Edgar [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Limiting spectral distribution; sample covariance matrix; Stieltjes transform; numerical computation; high-dimensional statistics; DIMENSIONAL RANDOM MATRICES; DETERMINISTIC EQUIVALENT; SINGULAR-VALUES; CONVERGENCE; FRAMEWORK; ALGORITHM; CHANNELS;
D O I
10.1142/S2010326315500197
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Models from random matrix theory (RMT) are increasingly used to gain insights into the behavior of statistical methods under high-dimensional asymptotics. However, the applicability of the framework is limited by numerical problems. Consider the usual model of multivariate statistics where the data is a sample from a multivariate distribution with a given covariance matrix. Under high-dimensional asymptotics, there is a deterministic map from the distribution of eigenvalues of the population covariance matrix (the population spectral distribution or PSD), to the of empirical spectral distribution (ESD). The current methods for computing this map are inefficient, and this limits the applicability of the theory. We propose a new method to compute numerically the ESD from an arbitrary input PSD. Our method, called SPECTRODE, finds the support and the density of the ESD to high precision; we prove this for finite discrete distributions. In computational experiments SPECTRODE outperforms existing methods by orders of magnitude in speed and accuracy. We apply it to compute expectations and contour integrals of the ESD, which are often central in applications. We also illustrate that SPECTRODE is directly useful in statistical problems, such as estimation and hypothesis testing for covariance matrices. Our proposal, implemented in open source software, may broaden the use of RMT in high-dimensional data analysis.
引用
收藏
页数:36
相关论文
共 50 条