Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads

被引:10
|
作者
Duitama, Jorge [1 ]
Kennedy, Justin [1 ]
Dinakar, Sanjiv [2 ]
Hernandez, Yoezen [3 ]
Wu, Yufeng [1 ]
Mandoiu, Ion I. [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Unit 2155, Storrs, CT 06269 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[3] CUNY Hunter Coll, Dept Comp Sci, New York, NY 10021 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
基金
美国国家科学基金会;
关键词
HIDDEN MARKOV MODEL; STRUCTURAL VARIATION; GENOME; ASSOCIATION; IMPUTATION;
D O I
10.1186/1471-2105-12-S1-S53
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recent technology advances have enabled sequencing of individual genomes, promising to revolutionize biomedical research. However, deep sequencing remains more expensive than microarrays for performing whole-genome SNP genotyping. Results: In this paper we introduce a new multi-locus statistical model and computationally efficient genotype calling algorithms that integrate shotgun sequencing data with linkage disequilibrium (LD) information extracted from reference population panels such as Hapmap or the 1000 genomes project. Experiments on publicly available 454, Illumina, and ABI SOLiD sequencing datasets suggest that integration of LD information results in genotype calling accuracy comparable to that of microarray platforms from sequencing data of low-coverage. A software package implementing our algorithm, released under the GNU General Public License, is available at http://dna.engr.uconn.edu/software/GeneSeq/. Conclusions: Integration of LD information leads to significant improvements in genotype calling accuracy compared to prior LD-oblivious methods, rendering low-coverage sequencing as a viable alternative to microarrays for conducting large-scale genome-wide association studies.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Genomic prediction using low-coverage portable Nanopore sequencing
    Lamb, Harrison J.
    Hayes, Ben J.
    Randhawa, Imtiaz A. S.
    Nguyen, Loan T.
    Ross, Elizabeth M.
    PLOS ONE, 2021, 16 (12):
  • [42] Powerful eQTL mapping through low-coverage RNA sequencing
    Schwarz, Tommer
    Boltz, Toni
    Hou, Kangcheng
    Bot, Merel
    Duan, Chenda
    Loohuis, Loes Olde
    Boks, Marco P.
    Kahn, Rene S.
    Ophoff, Roel A.
    Pasaniuc, Bogdan
    HUMAN GENETICS AND GENOMICS ADVANCES, 2022, 3 (03):
  • [43] Rare Variant Association Testing Under Low-Coverage Sequencing
    Navon, Oron
    Sul, Jae Hoon
    Han, Buhm
    Conde, Lucia
    Bracci, Paige M.
    Riby, Jacques
    Skibola, Christine F.
    Eskin, Eleazar
    Halperin, Eran
    GENETICS, 2013, 194 (03): : 769 - +
  • [44] Construction of high-quality recombination maps with low-coverage genomic sequencing for joint linkage analysis in maize
    Li, Chunhui
    Li, Yongxiang
    Bradbury, Peter J.
    Wu, Xun
    Shi, Yunsu
    Song, Yanchun
    Zhang, Dengfeng
    Rodgers-Melnick, Eli
    Buckler, Edward S.
    Zhang, Zhiwu
    Li, Yu
    Wang, Tianyu
    BMC BIOLOGY, 2015, 13
  • [46] Detecting Pathogenic Structural Variants with Low-Coverage PacBio Sequencing
    Hickey, L.
    Wenger, A. M.
    Baybayan, P.
    Peluso, P.
    Korlach, J.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 729 - 729
  • [47] SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples
    Le, Si Quang
    Durbin, Richard
    GENOME RESEARCH, 2011, 21 (06) : 952 - 960
  • [48] Variance in estimated pairwise genetic distance under high versus low coverage sequencing: The contribution of linkage disequilibrium
    Shpak, Max
    Ni, Yang
    Lu, Jie
    Mueller, Peter
    THEORETICAL POPULATION BIOLOGY, 2017, 117 : 51 - 63
  • [49] Development and evaluation of a haplotype reference panel for low-coverage whole genome sequencing genotype imputation in turbot (Scophthalmus maximus)
    Cao, Junwen
    Huang, Zhihui
    Ma, Aijun
    Jiang, Yuhang
    Zhang, Hao
    Zhang, Rongchao
    Wang, Xinan
    Liu, Zhifeng
    Xu, Rongjing
    AQUACULTURE REPORTS, 2025, 41
  • [50] An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data
    Jin Zhang
    Jiayin Wang
    Yufeng Wu
    BMC Bioinformatics, 13