Nonasymptotic support recovery for high-dimensional sparse covariance matrices

被引:2
|
作者
Kashlak, Adam B. [1 ]
Kong, Linglong [1 ]
机构
[1] Univ Alberta, Math & Stat Sci, Edmonton, AB T6G 2G1, Canada
来源
STAT | 2021年 / 10卷 / 01期
关键词
concentration inequality; genomics; random matrix; Schatten norm; REGULARIZATION; ESTIMATORS;
D O I
10.1002/sta4.316
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For high-dimensional data, the standard empirical estimator for the covariance matrix is very poor, and thus many methods have been proposed to more accurately estimate the covariance structure of high-dimensional data. In this article, we consider estimation under the assumption of sparsity but regularize with respect to the individual false-positive rate for incorrectly including a matrix entry in the support of the final estimator. The two benefits of this approach are (1) an interpretable regularization parameter removing the need for computationally expensive tuning and (2) extremely fast computation time arising from use of a binary search algorithm implemented to find the best estimator within a carefully constructed operator norm ball. We compare our approach to universal thresholding estimators and lasso-style penalized estimators on both simulated data and data from gene expression for cancerous tumours.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Testing proportionality of two high-dimensional covariance matrices
    Cheng, Guanghui
    Liu, Baisen
    Tian, Guoliang
    Zheng, Shurong
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 150
  • [42] Semi-proximal augmented lagrangian method for sparse estimation of high-dimensional inverse covariance matrices
    Wu C.
    Xiao Y.
    Li P.
    Journal of Applied and Numerical Optimization, 2020, 2 (02): : 155 - 169
  • [44] High-Dimensional Covariance Decomposition into Sparse Markov and Independence Models
    Janzamin, Majid
    Anandkumar, Animashree
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1549 - 1591
  • [45] Estimation of two high-dimensional covariance matrices and the spectrum of their ratio
    Wen, Jun
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 168 : 1 - 29
  • [46] Test on the linear combinations of covariance matrices in high-dimensional data
    Zhidong Bai
    Jiang Hu
    Chen Wang
    Chao Zhang
    Statistical Papers, 2021, 62 : 701 - 719
  • [47] Power computation for hypothesis testing with high-dimensional covariance matrices
    Lin, Ruitao
    Liu, Zhongying
    Zheng, Shurong
    Yin, Guosheng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 104 : 10 - 23
  • [48] Homogeneity tests of covariance matrices with high-dimensional longitudinal data
    Zhong, Ping-Shou
    Li, Runze
    Santo, Shawn
    BIOMETRIKA, 2019, 106 (03) : 619 - 634
  • [49] Parallel computation of high-dimensional robust correlation and covariance matrices
    Chilson, James
    Ng, Raymond
    Wagner, Alan
    Zamar, Ruben
    ALGORITHMICA, 2006, 45 (03) : 403 - 431
  • [50] Homogeneity test of several covariance matrices with high-dimensional data
    Qayed, Abdullah
    Han, Dong
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2021, 31 (04) : 523 - 540