Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data

被引:7
|
作者
Durrant, C [1 ]
Morris, AP [1 ]
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford, England
关键词
D O I
10.1186/1471-2156-6-S1-S100
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We recently described a method for linkage disequilibrium (LD) mapping, using cladistic analysis of phased single-nucleotide polymorphism (SNP) haplotypes in a logistic regression framework. However, haplotypes are often not available and cannot be deduced with certainty from the unphased genotypes. One possible two-stage approach is to infer the phase of multilocus genotype data and analyze the resulting haplotypes as if known. Here, haplotypes are inferred using the expectation-maximization ( EM) algorithm and the best-guess phase assignment for each individual analyzed. However, inferring haplotypes from phase-unknown data is prone to error and this should be taken into account in the subsequent analysis. An alternative approach is to analyze the phase-unknown multilocus genotypes themselves. Here we present a generalization of the method for phase-known haplotype data to the case of unphased SNP genotypes. Our approach is designed for high-density SNP data, so we opted to analyze the simulated dataset. The marker spacing in the initial screen was too large for our method to be effective, so we used the answers provided to request further data in regions around the disease loci and in null regions. Power to detect the disease loci, accuracy in localizing the true site of the locus, and false-positive error rates are reported for the inferred-haplotype and unphased genotype methods. For this data, analyzing inferred haplotypes outperforms analysis of genotypes. As expected, our results suggest that when there is little or no LD between a disease locus and the flanking region, there will be no chance of detecting it unless the disease variant itself is genotyped.
引用
收藏
页数:5
相关论文
共 16 条
  • [1] Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data
    Caroline Durrant
    Andrew P Morris
    BMC Genetics, 6
  • [2] Linkage disequilibrium mapping via cladistic analysis of SNP haplotypes.
    Morris, A
    Durrant, C
    Zondervan, K
    Hunt, S
    Deloukas, P
    Cardon, L
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (05) : 613 - 613
  • [3] Linkage disequilibrium mapping via cladistic analysis: Loss of information due to unknown phase
    Durrant, C
    Morris, AP
    ANNALS OF HUMAN GENETICS, 2005, 69 : 767 - 767
  • [5] Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes
    Durrant, C
    Zondervan, KT
    Cardon, LR
    Hunt, S
    Deloukas, P
    Morris, AP
    AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (01) : 35 - 43
  • [6] The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
    McCaskie, PA
    Carter, KW
    McCaskie, SR
    Palmer, LJ
    BMC GENETICS, 2005, 6 (Suppl 1)
  • [7] The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
    Pamela A McCaskie
    Kim W Carter
    Simon R McCaskie
    Lyle J Palmer
    BMC Genetics, 6
  • [8] A framework for analyzing both linkage and association: an analysis of Genetic Analysis Workshop 16 simulated data
    E Warwick Daw
    Jevon Plunkett
    Mary Feitosa
    Xiaoyi Gao
    Andrew Van Brunt
    Duanduan Ma
    Jacek Czajkowski
    Michael A Province
    Ingrid Borecki
    BMC Proceedings, 3 (Suppl 7)
  • [9] Linkage analysis of genetic analysis workshop 12 simulated data based on affected individuals only
    Cordell, HJ
    Dudbridge, F
    GENETIC EPIDEMIOLOGY, 2001, 21 : S510 - S515
  • [10] Linkage mapping methods applied to the COGA data set: Presentation group 4 of genetic analysis workshop 14
    Daw, EW
    Doan, BQ
    Elston, RC
    GENETIC EPIDEMIOLOGY, 2005, 29 : S29 - S34