Power calculation for overall hypothesis testing with high-dimensional commensurate outcomes

被引:11
|
作者
Chi, Yueh-Yun [1 ]
Gribbin, Matthew J. [2 ]
Johnson, Jacqueline L. [3 ]
Muller, Keith E. [4 ]
机构
[1] Univ Florida, Dept Biostat, Gainesville, FL 32611 USA
[2] MedImmune, Dept Biostat, Gaithersburg, MD USA
[3] Univ N Carolina, Dept Psychiat, Chapel Hill, NC USA
[4] Univ Florida, Dept Hlth Outcomes & Policy, Gainesville, FL USA
关键词
MANOVA; metabolomics; genomics; proteomics; SAMPLE-SIZE DETERMINATION; LINEAR-MODEL POWER; FEWER OBSERVATIONS; COVARIANCE; VARIANCE; BIAS;
D O I
10.1002/sim.5986
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The complexity of system biology means that any metabolic, genetic, or proteomic pathway typically includes so many components (e.g., molecules) that statistical methods specialized for overall testing of high-dimensional and commensurate outcomes are required. While many overall tests have been proposed, very few have power and sample size methods. We develop accurate power and sample size methods and software to facilitate study planning for high-dimensional pathway analysis. With an account of any complex correlation structure between high-dimensional outcomes, the new methods allow power calculation even when the sample size is less than the number of variables. We derive the exact (finite-sample) and approximate non-null distributions of the univariate' approach to repeated measures test statistic, as well as power-equivalent scenarios useful to generalize our numerical evaluations. Extensive simulations of group comparisons support the accuracy of the approximations even when the ratio of number of variables to sample size is large. We derive a minimum set of constants and parameters sufficient and practical for power calculation. Using the new methods and specifying the minimum set to determine power for a study of metabolic consequences of vitamin B6 deficiency helps illustrate the practical value of the new results. Free software implementing the power and sample size methods applies to a wide range of designs, including one group pre-intervention and post-intervention comparisons, multiple parallel group comparisons with one-way or factorial designs, and the adjustment and evaluation of covariate effects. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:812 / 827
页数:16
相关论文
共 50 条
  • [21] HYPOTHESIS TESTING FOR BLOCK-STRUCTURED CORRELATION FOR HIGH-DIMENSIONAL VARIABLES
    Zheng, Shurong
    He, Xuming
    Guo, Jianhua
    STATISTICA SINICA, 2022, 32 (02) : 719 - 735
  • [22] Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models
    Ma, Rong
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (534) : 984 - 998
  • [23] Testing linear hypothesis of high-dimensional means with unequal covariance matrices
    Cao, Mingxiang
    Liang, Shiting
    He, Daojiang
    Xu, Kai
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2022, 51 (02) : 526 - 541
  • [24] Hypothesis testing for band size detection of high-dimensional banded precision matrices
    An, Baiguo
    Guo, Jianhua
    Liu, Yufeng
    BIOMETRIKA, 2014, 101 (02) : 477 - 483
  • [25] Bayesian hypothesis testing for equality of high-dimensional means using cluster subspaces
    Chen, Fang
    Hai, Qiuchen
    Wang, Min
    COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1301 - 1320
  • [26] A strong converse bound for multiple hypothesis testing, with applications to high-dimensional estimation
    Venkataramanan, Ramji
    Johnson, Oliver
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (01): : 1126 - 1149
  • [27] HYPOTHESIS TESTING FOR DENSITIES AND HIGH-DIMENSIONAL MULTINOMIALS: SHARP LOCAL MINIMAX RATES
    Balakrishnan, Sivaraman
    Wasserman, Larry
    ANNALS OF STATISTICS, 2019, 47 (04): : 1893 - 1927
  • [28] HYPOTHESIS TESTING IN HIGH-DIMENSIONAL INSTRUMENTAL VARIABLES REGRESSION WITH AN APPLICATION TO GENOMICS DATA
    Lu, Jiarui
    Li, Hongzhe
    STATISTICA SINICA, 2022, 32 : 613 - 633
  • [29] HYPOTHESIS TESTING FOR HIGH-DIMENSIONAL TIME SERIES VIA SELF-NORMALIZATION
    Wang, Runmin
    Shao, Xiaofeng
    ANNALS OF STATISTICS, 2020, 48 (05): : 2728 - 2758
  • [30] Hypothesis testing for high-dimensional multivariate regression with false discovery rate control
    Zhu, Yunlong
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (21) : 7476 - 7495