Construction of the model for the Genetic Analysis Workshop 14 simulated data: genotype-phenotype relationships, gene interaction, linkage, association, disequilibrium, and ascertainment effects for a complex phenotype

被引:10
|
作者
Greenberg, DA [1 ]
Zhang, JY [1 ]
Shmulewitz, D [1 ]
Strug, LJ [1 ]
Zimmerman, R [1 ]
Singh, V [1 ]
Marathe, S [1 ]
机构
[1] Columbia Presbyterian Med Ctr, Mailman Sch Publ Hlth, Dept Biostat & Psychiat, Div Stat Genet, New York, NY 10032 USA
关键词
Linkage Disequilibrium; Borderline Personality Disorder; Speech Pattern; Genetic Analysis Workshop; Disease Allele;
D O I
10.1186/1471-2156-6-S1-S3
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The Genetic Analysis Workshop 14 simulated dataset was designed 1) To test the ability to find genes related to a complex disease (such as alcoholism). Such a disease may be given a variety of definitions by different investigators, have associated endophenotypes that are common in the general population, and is likely to be not one disease but a heterogeneous collection of clinically similar, but genetically distinct, entities. 2) To observe the effect on genetic analysis and gene discovery of a complex set of gene x gene interactions. 3) To allow comparison of microsatellite vs. large-scale single-nucleotide polymorphism (SNP) data. 4) To allow testing of association to identify the disease gene and the effect of moderate marker x marker linkage disequilibrium. 5) To observe the effect of different ascertainment/disease definition schemes on the analysis. Data was distributed in two forms. Data distributed to participants contained about 1,000 SNPs and 400 microsatellite markers. Internet-obtainable data consisted of a finer 10,000 SNP map, which also contained data on controls. While disease characteristics and parameters were constant, four "studies" used varying ascertainment schemes based on differing beliefs about disease characteristics. One of the studies contained multiplex two- and three-generation pedigrees with at least four affected members. The simulated disease was a psychiatric condition with many associated behaviors (endophenotypes), almost all of which were genetic in origin. The underlying disease model contained four major genes and two modifier genes. The four major genes interacted with each other to produce three different phenotypes, which were themselves heterogeneous. The population parameters were calibrated so that the major genes could be discovered by linkage analysis in most datasets. The association evidence was more difficult to calibrate but was designed to find statistically significant association in 50% of datasets. We also simulated some marker x marker linkage disequilibrium around some of the genes and also in areas without disease genes. We tried two different methods to simulate the linkage disequilibrium.
引用
收藏
页数:8
相关论文
共 11 条
  • [1] Construction of the model for the Genetic Analysis Workshop 14 simulated data: genotype-phenotype relationships, gene interaction, linkage, association, disequilibrium, and ascertainment effects for a complex phenotype
    David A Greenberg
    Junying Zhang
    Dvora Shmulewitz
    Lisa J Strug
    Regina Zimmerman
    Veena Singh
    Sudhir Marathe
    BMC Genetics, 6
  • [2] GENETIC-STRUCTURE AND THE SEARCH FOR GENOTYPE-PHENOTYPE RELATIONSHIPS - AN EXAMPLE FROM DISEQUILIBRIUM IN THE APO-B GENE REGION
    ZERBA, KE
    KESSLING, AM
    DAVIGNON, J
    SING, CF
    GENETICS, 1991, 129 (02) : 525 - 533
  • [3] Using gene expression data to identify causal pathways between genotype and phenotype in a complex disease: application to Genetic Analysis Workshop 19
    Holly F. Ainsworth
    Heather J. Cordell
    BMC Proceedings, 10 (Suppl 7)
  • [4] Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data
    Durrant, C
    Morris, AP
    BMC GENETICS, 2005, 6 (Suppl 1)
  • [5] Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data
    Caroline Durrant
    Andrew P Morris
    BMC Genetics, 6
  • [6] IFIH1 gene polymorphisms in type 1 diabetes: Genetic association analysis and genotype-phenotype correlation in the Belgian population
    Aminkeng, Folefac
    Van Autreve, Jan E.
    Weets, Ilse
    Quartier, Erik
    Van Schravendijk, Chris
    Gorus, Frans K.
    Van der Auwera, Bart J.
    HUMAN IMMUNOLOGY, 2009, 70 (09) : 706 - 710
  • [7] IFIH1 gene polymorphisms in type 1 diabetes: genetic association analysis and genotype-phenotype correlation in Chinese Han population
    Yang, Hui
    Wang, Zhixiao
    Xu, Kuanfeng
    Gu, Rong
    Chen, Heng
    Yu, Dan
    Xing, Chunyan
    Liu, Yu
    Yu, Liping
    Hutton, John
    Eisenbarth, George
    Yang, Tao
    AUTOIMMUNITY, 2012, 45 (03) : 226 - 232
  • [8] Linkage analysis of a derived glucose phenotype in the Genetic Analysis Workshop 13 simulated data using a variety of Haseman-Elston based regression methods
    Cordell, HJ
    Howson, JMM
    Clayton, DG
    BMC GENETICS, 2003, 4 (Suppl 1)
  • [9] Linkage analysis of a derived glucose phenotype in the Genetic Analysis Workshop 13 simulated data using a variety of Haseman-Elston based regression methods
    Heather J Cordell
    Joanna MM Howson
    David G Clayton
    BMC Genetics, 4
  • [10] Covariate linkage analysis of GAW14 simulated data incorporating subclinical phenotype, sex, population, parent-of-origin, and interaction
    Marian L Hamshere
    Stuart MacGregor
    Valentina Moskvina
    Ivan N Nikolov
    Peter A Holmans
    BMC Genetics, 6