Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data

被引:1
|
作者
Boutry, Simon [1 ,2 ]
Helaers, Raphael [1 ]
Lenaerts, Tom [2 ,3 ,4 ]
Vikkula, Miikka [1 ,5 ]
机构
[1] Univ Louvain, Human Mol Genet, de Duve Inst, Brussels, Belgium
[2] Vrije Univ Brussel, Univ Libre Bruxelles, Interuniv Inst Bioinformat Brussels, Brussels, Belgium
[3] Univ Libre Bruxelles, Machine Learning Grp, Brussels, Belgium
[4] Vrije Univ Brussel, Artificial Intelligence Lab, Brussels, Belgium
[5] WEL Res Inst, WELBIO Dept, Wavre, Belgium
关键词
STATISTICAL TESTS; DISEASE ASSOCIATION; COMMON DISEASES; DETECTING ASSOCIATIONS; GENETIC ASSOCIATION; GENERAL FRAMEWORK; MULTIPLE SNPS; R PACKAGE; POWER; PATHOGENICITY;
D O I
10.1371/journal.pcbi.1011488
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The development of high-throughput next-generation sequencing technologies and large-scale genetic association studies produced numerous advances in the biostatistics field. Various aggregation tests, i.e. statistical methods that analyze associations of a trait with multiple markers within a genomic region, have produced a variety of novel discoveries. Notwithstanding their usefulness, there is no single test that fits all needs, each suffering from specific drawbacks. Selecting the right aggregation test, while considering an unknown underlying genetic model of the disease, remains an important challenge. Here we propose a new ensemble method, called Excalibur, based on an optimal combination of 36 aggregation tests created after an in-depth study of the limitations of each test and their impact on the quality of result. Our findings demonstrate the ability of our method to control type I error and illustrate that it offers the best average power across all scenarios. The proposed method allows for novel advances in Whole Exome/Genome sequencing association studies, able to handle a wide range of association models, providing researchers with an optimal aggregation analysis for the genetic regions of interest. An increasing number of diseases previously thought to be caused by a mutation in a single gene are now being considered as involving several variants in a small number of genes (i.e. "oligogenic"). There is a limited number of dedicated bioinformatic tools to study such oligogenic causes of diseases. These include so called aggregation tests. Yet, an important challenge is to select the right aggregation test among the various ones that have been developed, as each suffers from different limitations. We have computationally compared 59 aggregation methods to explore their limitations. We found that combining 36 of them results in a more robust method, which we baptized "Excalibur". It can handle a wider range of hypotheses and case-control studies than any of the single methods, while reducing the number of false positive results. Excalibur also provides a comprehensive elucidation of the underlying genetic architecture pertaining to each genomic region under investigation. Thus, it provides a user-friendly, and statistically sound platform to study oligogenic inheritance with the increasing amount of available genetic data.
引用
收藏
页数:26
相关论文
共 28 条
  • [21] Integrating Rare-Variant Testing, Function Prediction, and Gene Network in Composite Resequencing-Based Genome-Wide Association Studies (CR-GWAS)
    Zhu, Chengsong
    Li, Xianran
    Yu, Jianming
    G3-GENES GENOMES GENETICS, 2011, 1 (03): : 233 - 243
  • [22] Gene-based Rare Variant Association Tests for Ancestry-matched Case-control Data
    Wang, Chaolong
    Sun, Baoluo
    Cheng, Shanshan
    Wang, Zengmiao
    Deng, Minghua
    Chen, Han
    GENETIC EPIDEMIOLOGY, 2019, 43 (07) : 914 - 915
  • [23] Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants
    Kinnamon, Daniel D.
    Hershberger, Ray E.
    Martin, Eden R.
    PLOS ONE, 2012, 7 (02):
  • [24] Rare variant association tests for ancestry-matched case-control data based on conditional logistic regression
    Cheng, Shanshan
    Lyu, Jingjing
    Shi, Xian
    Wang, Kai
    Wang, Zengmiao
    Deng, Minghua
    Sun, Baoluo
    Wang, Chaolong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [25] A unifying framework for rare variant association testing in family-based designs, including higher criticism approaches, SKATs, and burden tests
    Hecker, Julian
    Townes, F. William
    Kachroo, Priyadarshini
    Laurie, Cecelia
    Lasky-Su, Jessica
    Ziniti, John
    Cho, Michael H.
    Weiss, Scott T.
    Laird, Nan M.
    Lange, Christoph
    BIOINFORMATICS, 2020, 36 (22-23) : 5432 - 5438
  • [26] BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing
    Yan, Song
    Li, Yun
    BIOINFORMATICS, 2014, 30 (04) : 480 - 487
  • [27] Whole-exome sequencing and gene-based rare variant association tests suggest that PLA2G4E might be a risk gene for panic disorder
    Morimoto, Yoshiro
    Shimada-Sugimoto, Mihoko
    Otowa, Takeshi
    Yoshida, Shintaro
    Kinoshita, Akira
    Mishima, Hiroyuki
    Yamaguchi, Naohiro
    Mori, Takatoshi
    Imamura, Akira
    Ozawa, Hiroki
    Kurotaki, Naohiro
    Ziegler, Christiane
    Domschke, Katharina
    Deckert, Juergen
    Umekage, Tadashi
    Tochigi, Mamoru
    Kaiya, Hisanobu
    Okazaki, Yuji
    Tokunaga, Katsushi
    Sasaki, Tsukasa
    Yoshiura, Koh-ichiro
    Ono, Shinji
    TRANSLATIONAL PSYCHIATRY, 2018, 8
  • [28] Whole-exome sequencing and gene-based rare variant association tests suggest that PLA2G4E might be a risk gene for panic disorder
    Yoshiro Morimoto
    Mihoko Shimada-Sugimoto
    Takeshi Otowa
    Shintaro Yoshida
    Akira Kinoshita
    Hiroyuki Mishima
    Naohiro Yamaguchi
    Takatoshi Mori
    Akira Imamura
    Hiroki Ozawa
    Naohiro Kurotaki
    Christiane Ziegler
    Katharina Domschke
    Jürgen Deckert
    Tadashi Umekage
    Mamoru Tochigi
    Hisanobu Kaiya
    Yuji Okazaki
    Katsushi Tokunaga
    Tsukasa Sasaki
    Koh-ichiro Yoshiura
    Shinji Ono
    Translational Psychiatry, 8