An Efficient Nonlinear Regression Approach for Genome-wide Detection of Marginal and Interacting Genetic Variations

被引:2
|
作者
Lee, Seunghak [1 ]
Lozano, Aurelie [2 ]
Kambadur, Prabhanjan [3 ]
Xing, Eric P. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, 5000 Forbes Ave, Pittsburgh, PA 15217 USA
[2] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY USA
[3] Bloomberg LP, New York, NY USA
关键词
genome-wide association mapping; SNP-SNP interaction; piecewise linear model screening; stability selection; group lasso; ALZHEIMERS-DISEASE; LATE-ONSET; ASSOCIATION; LASSO; DOPAMINE;
D O I
10.1089/cmb.2015.0202
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies have revealed individual genetic variants associated with phenotypic traits such as disease risk and gene expressions. However, detecting pairwise interaction effects of genetic variants on traits still remains a challenge due to a large number of combinations of variants (approximate to 10(11) SNP pairs in the human genome), and relatively small sample sizes (typically <10(4)). Despite recent breakthroughs in detecting interaction effects, there are still several open problems, including: (1) how to quickly process a large number of SNP pairs, (2) how to distinguish between true signals and SNPs/SNP pairs merely correlated with true signals, (3) how to detect nonlinear associations between SNP pairs and traits given small sample sizes, and (4) how to control false positives. In this article, we present a unified framework, called SPHINX, which addresses the aforementioned challenges. We first propose a piecewise linear model for interaction detection, because it is simple enough to estimate model parameters given small sample sizes but complex enough to capture nonlinear interaction effects. Then, based on the piecewise linear model, we introduce randomized group lasso under stability selection, and a screening algorithm to address the statistical and computational challenges mentioned above. In our experiments, we first demonstrate that SPHINX achieves better power than existing methods for interaction detection under false positive control. We further applied SPHINX to late-onset Alzheimer's disease dataset, and report 16 SNPs and 17 SNP pairs associated with gene traits. We also present a highly scalable implementation of our screening algorithm, which can screen approximate to 118 billion candidates of associations on a 60-node cluster in <5.5 hours.
引用
收藏
页码:372 / 389
页数:18
相关论文
共 50 条
  • [31] Genome-wide association studies are enriched for interacting genes
    Nguyen, Peter T.
    Coetzee, Simon G.
    Silacheva, Irina
    Hazelett, Dennis J.
    BIODATA MINING, 2025, 18 (01):
  • [32] The LOVD3 platform: efficient genome-wide sharing of genetic variants
    Ivo F.A.C. Fokkema
    Mark Kroon
    Julia A. López Hernández
    Daan Asscheman
    Ivar Lugtenburg
    Jerry Hoogenboom
    Johan T. den Dunnen
    European Journal of Human Genetics, 2021, 29 : 1796 - 1803
  • [33] The LOVD3 platform: efficient genome-wide sharing of genetic variants
    Fokkema, Ivo F. A. C.
    Kroon, Mark
    Hernandez, Julia A. Lopez
    Asscheman, Daan
    Lugtenburg, Ivar
    Hoogenboom, Jerry
    den Dunnen, Johan T.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2021, 29 (12) : 1796 - 1803
  • [34] Genetic Variations in Cytokines and Cytokine Receptors Associated with Psoriasis Found by Genome-Wide Association
    Duffin, Kristina Callis
    Krueger, Gerald G.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2009, 129 (04) : 827 - 833
  • [35] Genetic variations for egg internal quality of ducks revealed by genome-wide association study
    Liu, Hehe
    Zhou, Zhengkui
    Hu, Jian
    Guo, Zhanbao
    Xu, Yaxi
    Li, Yanying
    Wang, Lei
    Fan, Wenlei
    Liang, Suyun
    Liu, Dapeng
    Zhang, Yunsheng
    Xie, Ming
    Tang, Jing
    Huang, Wei
    Zhang, Qi
    Hou, Shuisheng
    ANIMAL GENETICS, 2021, 52 (04) : 536 - 541
  • [36] Genetic variations for the eggshell crystal structure revealed by genome-wide association study in chickens
    Quanlin Li
    Zhongyi Duan
    Congjiao Sun
    Jiangxia Zheng
    Guiyun Xu
    Ning Yang
    BMC Genomics, 22
  • [37] Genetic basis of vascular bundle variations in rice revealed by genome-wide association study
    Liao, Shiyu
    Yan, Ju
    Xing, Hongkun
    Tu, Yuan
    Zhao, Hu
    Wang, Gongwei
    PLANT SCIENCE, 2021, 302
  • [38] Genetic variations for the eggshell crystal structure revealed by genome-wide association study in chickens
    Li, Quanlin
    Duan, Zhongyi
    Sun, Congjiao
    Zheng, Jiangxia
    Xu, Guiyun
    Yang, Ning
    BMC GENOMICS, 2021, 22 (01)
  • [39] Genome-wide polymorphisms and development of a microarray platform to detect genetic variations in Plasmodium yoelii
    Nair, Sethu C.
    Pattaradilokrat, Sittiporn
    Zilversmit, Martine M.
    Dommer, Jennifer
    Nagarajan, Vijayaraj
    Stephens, Melissa T.
    Xiao, Wenming
    Tan, John C.
    Su, Xin-zhuan
    MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2014, 194 (1-2) : 9 - 15
  • [40] Genome-wide Association: "A Revolutionary Approach"
    Gupta, Vipin
    Saraswathy, K. N.
    Khadgawat, Rajesh
    Sachdeva, M. P.
    INTERNATIONAL JOURNAL OF HUMAN GENETICS, 2009, 9 (02) : 97 - 103