Differences in set-based tests for sparse alternatives when testing sets of outcomes compared to sets of explanatory factors in genetic association studies

被引:0
|
作者
Sun, Ryan [1 ]
Shi, Andy [2 ]
Lin, Xihong [2 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, 1515 Holcombe Blvd, Houston, TX 77030 USA
[2] Harvard TH Chan Sch Publ Hlth, Dept Biostat, 677 Huntington Ave, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
Detection boundary; Genetic association study; Multiple outcomes; Set-based inference; Sparse alternative; HIGHER CRITICISM; STATISTICS;
D O I
10.1093/biostatistics/kxac036
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Set-based association tests are widely popular in genetic association settings for their ability to aggregate weak signals and reduce multiple testing burdens. In particular, a class of set-based tests including the Higher Criticism, Berk-Jones, and other statistics have recently been popularized for reaching a so-called detection boundary when signals are rare and weak. Such tests have been applied in two subtly different settings: (a) associating a genetic variant set with a single phenotype and (b) associating a single genetic variant with a phenotype set. A significant issue in practice is the choice of test, especially when deciding between innovated and generalized type methods for detection boundary tests. Conflicting guidance is present in the literature. This work describes how correlation structures generate marked differences in relative operating characteristics for settings (a) and (b). The implications for study design are significant. We also develop novel power bounds that facilitate the aforementioned calculations and allow for analysis of individual testing settings. In more concrete terms, our investigation is motivated by translational expression quantitative trait loci (eQTL) studies in lung cancer. These studies involve both testing for groups of variants associated with a single gene expression (multiple explanatory factors) and testing whether a single variant is associated with a group of gene expressions (multiple outcomes). Results are supported by a collection of simulation studies and illustrated through lung cancer eQTL examples.
引用
收藏
页码:171 / 187
页数:17
相关论文
共 4 条
  • [1] Set-Based Tests for Genetic Association in Longitudinal Studies
    He, Zihuai
    Zhang, Min
    Lee, Seunggeun
    Smith, Jennifer A.
    Guo, Xiuqing
    Palmas, Walter
    Kardia, Sharon L. R.
    Roux, Ana V. Diez
    Mukherjee, Bhramar
    BIOMETRICS, 2015, 71 (03) : 606 - 615
  • [2] Set-Based Tests for Genetic Association Studies with Interval-Censored Competing Risks Outcomes
    Xu, Zhichao
    Choi, Jaihee
    Sun, Ryan
    STATISTICS IN BIOSCIENCES, 2024,
  • [3] Set-based genetic association and interaction tests for survival outcomes based on weighted V statistics
    Li, Chenxi
    Wu, Di
    Lu, Qing
    GENETIC EPIDEMIOLOGY, 2021, 45 (01) : 46 - 63
  • [4] Inference for set-based effects in genetic association studies with interval-censored outcomes
    Sun, Ryan
    Zhu, Liang
    Li, Yimei
    Yasui, Yutaka
    Robison, Leslie
    BIOMETRICS, 2023, 79 (02) : 1573 - 1585