Efficient phasing and imputation of low-coverage sequencing data using large reference panels

被引:174
|
作者
Rubinacci, Simone [1 ,2 ]
Ribeiro, Diogo M. [1 ,2 ]
Hofmeister, Robin J. [1 ,2 ]
Delaneau, Olivier [1 ,2 ]
机构
[1] Univ Lausanne, Dept Computat Biol, Lausanne, Switzerland
[2] Univ Lausanne, Swiss Inst Bioinformat, Lausanne, Switzerland
关键词
LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; GENOME; ASSOCIATION; DISCOVERY; FRAMEWORK; RESOURCE; SNP;
D O I
10.1038/s41588-020-00756-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
GLIMPSE is a new method for haplotype phasing and genotype imputation of low-coverage sequencing datasets from large reference panels. GLIMPSE shows remarkable performance across different coverages and human populations. Low-coverage whole-genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined because current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. GLIMPSE achieves imputation of a genome for less than US$1 in computational cost, considerably outperforming other methods and improving imputation accuracy over the full allele frequency range. As a proof of concept, we show that 1x coverage enables effective gene expression association studies and outperforms dense SNP arrays in rare variant burden tests. Overall, this study illustrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies.
引用
收藏
页码:120 / 126
页数:22
相关论文
共 50 条
  • [31] Kinship Estimation Based on Extremely Low-Coverage Sequencing Data
    Dou, Jinzhuang
    Chothani, Sonia
    Sim, Xueling
    Hughes, Jason D.
    Reilly, Dermot F.
    Tai, E. Shyong
    Liu, Jianjun
    Wang, Chaolong
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 619 - 620
  • [32] Optimisation of low-coverage sequencing approach
    Daviaud, Christian
    Gerber, Zuzana
    Meslage, Stephane
    Bacq-Daian, Dephine
    Meyer, Vincent
    Boland, Anne
    Deleuze, Jean-Francois
    Olaso, Robert
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 613 - 613
  • [33] Extremely low-coverage sequencing and imputation increases power for genome-wide association studies
    Bogdan Pasaniuc
    Nadin Rohland
    Paul J McLaren
    Kiran Garimella
    Noah Zaitlen
    Heng Li
    Namrata Gupta
    Benjamin M Neale
    Mark J Daly
    Pamela Sklar
    Patrick F Sullivan
    Sarah Bergen
    Jennifer L Moran
    Christina M Hultman
    Paul Lichtenstein
    Patrik Magnusson
    Shaun M Purcell
    David W Haas
    Liming Liang
    Shamil Sunyaev
    Nick Patterson
    Paul I W de Bakker
    David Reich
    Alkes L Price
    Nature Genetics, 2012, 44 : 631 - 635
  • [34] Genomic prediction using low-coverage portable Nanopore sequencing
    Lamb, Harrison J.
    Hayes, Ben J.
    Randhawa, Imtiaz A. S.
    Nguyen, Loan T.
    Ross, Elizabeth M.
    PLOS ONE, 2021, 16 (12):
  • [35] Extremely low-coverage sequencing and imputation increases power for genome-wide association studies
    Pasaniuc, Bogdan
    Rohland, Nadin
    McLaren, Paul J.
    Garimella, Kiran
    Zaitlen, Noah
    Li, Heng
    Gupta, Namrata
    Neale, Benjamin M.
    Daly, Mark J.
    Sklar, Pamela
    Sullivan, Patrick F.
    Bergen, Sarah
    Moran, Jennifer L.
    Hultman, Christina M.
    Lichtenstein, Paul
    Magnusson, Patrik
    Purcell, Shaun M.
    Haas, David W.
    Liang, Liming
    Sunyaev, Shamil
    Patterson, Nick
    de Bakker, Paul I. W.
    Reich, David
    Price, Alkes L.
    NATURE GENETICS, 2012, 44 (06) : 631 - U41
  • [36] MitoIMP: A Computational Framework for Imputation of Missing Data in Low-Coverage Human Mitochondrial Genome
    Ishiya, Koji
    Mizuno, Fuzuki
    Wang, Li
    Ueda, Shintaroh
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2019, 13
  • [37] Genotype Imputation from Large Reference Panels
    Das, Sayantan
    Abecasis, Goncalo R.
    Browning, Brian L.
    ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 19, 2018, 19 : 73 - 96
  • [38] Evaluation for the effect of low-coverage sequencing on genomic selection in large yellow croaker
    Zhang, Wenjing
    Li, Wanbo
    Liu, Guijia
    Gu, Linlin
    Ye, Kun
    Zhang, Yongjie
    Li, Wei
    Jiang, Dan
    Wang, Zhiyong
    Fang, Ming
    AQUACULTURE, 2021, 534
  • [39] Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data
    Thomas A. Delomas
    Stuart C. Willis
    BMC Bioinformatics, 24
  • [40] Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data
    Delomas, Thomas A.
    Willis, Stuart C.
    BMC BIOINFORMATICS, 2023, 24 (01)