Faster Algorithms for High-Dimensional Robust Covariance Estimation

被引:0
|
作者
Cheng, Yu [1 ]
Diakonikolas, Ilias [2 ]
Ge, Rong [1 ]
Woodruff, David P. [3 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem with near-optimal error guarantees for several natural structured distributions. Our main contribution is to develop faster algorithms for this problem whose running time nearly matches that of computing the empirical covariance. Given N = (Omega) over tilde (d(2)/epsilon(2)) samples from a d-dimensional Gaussian distribution, an epsilon-fraction of which may be arbitrarily corrupted, our algorithm runs in time (O) over tilde (d(3.26))= poly(epsilon) and approximates the unknown covariance matrix to optimal error up to a logarithmic factor. Previous robust algorithms with comparable error guarantees all have runtimes (Omega) over tilde (d(2 omega)) when epsilon = Omega(1), where omega is the exponent of matrix multiplication. We also provide evidence that improving the running time of our algorithm may require new algorithmic techniques.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] On Coupling Robust Estimation with Regularization for High-Dimensional Data
    Kalina, Jan
    Hlinka, Jaroslav
    DATA SCIENCE: INNOVATIVE DEVELOPMENTS IN DATA ANALYSIS AND CLUSTERING, 2017, : 15 - 27
  • [42] A Robust High-Dimensional Estimation of Multinomial Mixture Models
    Sabbaghi, Azam
    Eskandari, Farzad
    Navabpoor, Hamid Reza
    JOURNAL OF STATISTICAL THEORY AND APPLICATIONS, 2021, 20 (01): : 21 - 32
  • [43] A novel robust estimation for high-dimensional precision matrices
    Wang, Shaoxin
    Xie, Chaoping
    Kang, Xiaoning
    STATISTICS IN MEDICINE, 2023, 42 (05) : 656 - 675
  • [44] Lower bound estimation for a family of high-dimensional sparse covariance matrices
    Li, Huimin
    Liu, Youming
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2024, 22 (02)
  • [45] Adaptive banding covariance estimation for high-dimensional multivariate longitudinal data
    Qian, Fang
    Zhang, Weiping
    Chen, Yu
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (03): : 906 - 938
  • [46] Ridge estimation of inverse covariance matrices from high-dimensional data
    van Wieringen, Wessel N.
    Peeters, Carel F. W.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 284 - 303
  • [47] Optimal estimation of high-dimensional sparse covariance matrices with missing data
    Miao, Li
    Wang, Jinru
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024,
  • [48] Estimation and optimal structure selection of high-dimensional Toeplitz covariance matrix
    Yang, Yihe
    Zhou, Jie
    Pan, Jianxin
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 184
  • [49] Model averaging estimation for high-dimensional covariance matrices with a network structure
    Zhu, Rong
    Zhang, Xinyu
    Ma, Yanyuan
    Zou, Guohua
    ECONOMETRICS JOURNAL, 2021, 24 (01): : 177 - 197
  • [50] Improved Shrinkage-to-Tapering Estimation for High-Dimensional Covariance Matrices
    Li, Jianbo
    Zhou, Jie
    Zhang, Bin
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 1064 - 1071