High-dimensional two-sample mean vectors test and support recovery with factor adjustment

被引:0
|
作者
He, Yong [1 ]
Zhang, Mingjuan [2 ]
Zhang, Xinsheng [3 ]
Zhou, Wang [4 ]
机构
[1] Shandong Univ, Inst Financial Studies, Jinan 250100, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Stat & Math, Shanghai 201209, Peoples R China
[3] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China
[4] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
基金
美国国家科学基金会;
关键词
False discovery rate; High-dimensional; Multiple testing; Multiplier bootstrap; Two-sample test; FALSE DISCOVERY RATE; LARGE COVARIANCE ESTIMATION; PRINCIPAL-COMPONENTS; FEWER OBSERVATIONS; FACTOR MODELS; NUMBER; MATRIX; PROPORTION; BOOTSTRAP; RETURNS;
D O I
10.1016/j.csda.2020.107004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Testing the equality of two mean vectors is a classical problem in multivariate analysis. In this article, we consider the test in the high-dimensional setting. Existing tests often assume that the covariance matrix (or its inverse) of the underlying variables is sparse, which is rarely true in social science due to the existence of latent common factors. In the article, we introduce a maximum-type test statistic based on the factor-adjusted data. The factor-adjustment step increases the signal-to-noise ratio and thus results in more powerful test. We obtain the limiting null distribution of the maximum-type test statistic, which is the extreme value distribution of type I. To overcome the well-known slow convergence rate of the test statistic's distribution to the limiting extreme value distribution, we also propose a multiplier bootstrap method to improve the finite-sample performance. In addition, a multiple testing procedure with false discovery rate (FDR) control is proposed for identifying specific locations that differ significantly between the two groups. Thorough numerical studies are conducted to show the superiority of the test over other state-of-the-art tests. The performance of the test is also assessed through a real stock market dataset. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Robust two-sample test of high-dimensional mean vectors under dependence
    Wang, Wei
    Lin, Nan
    Tang, Xiang
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 169 : 312 - 329
  • [2] A high-dimensional two-sample test for the mean using random subspaces
    Thulin, Mans
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 26 - 38
  • [3] Two-sample mean vector projection test in high-dimensional data
    Huang, Caizhu
    Cui, Xia
    Pagui, Euloge Clovis Kenne
    COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1061 - 1091
  • [4] Two-sample mean vector projection test in high-dimensional data
    Caizhu Huang
    Xia Cui
    Euloge Clovis Kenne Pagui
    Computational Statistics, 2024, 39 : 1061 - 1091
  • [5] A note on high-dimensional two-sample test
    Feng, Long
    Sun, Fasheng
    STATISTICS & PROBABILITY LETTERS, 2015, 105 : 29 - 36
  • [6] Order test for high-dimensional two-sample means
    Lee, Sang H.
    Lim, Johan
    Li, Erning
    Vannucci, Marina
    Petkova, Eva
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (09) : 2719 - 2725
  • [7] An adaptive two-sample test for high-dimensional means
    Xu, Gongjun
    Lin, Lifeng
    Wei, Peng
    Pan, Wei
    BIOMETRIKA, 2016, 103 (03) : 609 - 624
  • [8] A Robust High-Dimensional Test for Two-Sample Comparisons
    Bulut, Hasan
    Iftikhar, Soofia
    Faiz, Nosheen
    Albalawi, Olayan
    AXIOMS, 2024, 13 (09)
  • [9] A NEW TEST FOR HIGH-DIMENSIONAL TWO-SAMPLE MEAN PROBLEMS WITH CONSIDERATION OF CORRELATION STRUCTURE
    Yang, Songshan
    Zheng, Shurong
    Li, Runze
    ANNALS OF STATISTICS, 2024, 52 (05): : 2217 - 2240
  • [10] Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings
    Cai, Tony
    Liu, Weidong
    Xia, Yin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 265 - 277