Efficient phasing and imputation of low-coverage sequencing data using large reference panels

被引:174
|
作者
Rubinacci, Simone [1 ,2 ]
Ribeiro, Diogo M. [1 ,2 ]
Hofmeister, Robin J. [1 ,2 ]
Delaneau, Olivier [1 ,2 ]
机构
[1] Univ Lausanne, Dept Computat Biol, Lausanne, Switzerland
[2] Univ Lausanne, Swiss Inst Bioinformat, Lausanne, Switzerland
关键词
LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; GENOME; ASSOCIATION; DISCOVERY; FRAMEWORK; RESOURCE; SNP;
D O I
10.1038/s41588-020-00756-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
GLIMPSE is a new method for haplotype phasing and genotype imputation of low-coverage sequencing datasets from large reference panels. GLIMPSE shows remarkable performance across different coverages and human populations. Low-coverage whole-genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined because current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. GLIMPSE achieves imputation of a genome for less than US$1 in computational cost, considerably outperforming other methods and improving imputation accuracy over the full allele frequency range. As a proof of concept, we show that 1x coverage enables effective gene expression association studies and outperforms dense SNP arrays in rare variant burden tests. Overall, this study illustrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies.
引用
收藏
页码:120 / 126
页数:22
相关论文
共 50 条
  • [1] Efficient phasing and imputation of low-coverage sequencing data using large reference panels
    Rubinacci, S.
    Ribeiro, D.
    Hofmeister, R.
    Delaneau, O.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 658 - 659
  • [2] Efficient phasing and imputation of low-coverage sequencing data using large reference panels
    Simone Rubinacci
    Diogo M. Ribeiro
    Robin J. Hofmeister
    Olivier Delaneau
    Nature Genetics, 2021, 53 : 120 - 126
  • [3] Publisher Correction: Efficient phasing and imputation of low-coverage sequencing data using large reference panels
    Simone Rubinacci
    Diogo M. Ribeiro
    Robin J. Hofmeister
    Olivier Delaneau
    Nature Genetics, 2021, 53 : 412 - 412
  • [4] Efficient phasing and imputation of low-coverage sequencing data using large reference panels (vol 53, pg 120, 2021)
    Rubinacci, Simone
    Ribeiro, Diogo M.
    Hofmeister, Robin J.
    Delaneau, Olivier
    NATURE GENETICS, 2021, 53 (03) : 412 - 412
  • [5] Fast Imputation of Low Coverage Sequencing Data from Very Large Reference Panels
    Rubinacci, S.
    Ribeiro, D.
    Hofmeister, R.
    Delaneau, O.
    HUMAN HEREDITY, 2020, 84 (4-5) : 221 - 222
  • [6] Low-Coverage Sequencing Imputation from millions of reference samples
    Rubinacci, Simone
    Delaneau, Olivier
    HUMAN HEREDITY, 2022, VOL. (SUPPL 1) : 4 - 5
  • [7] Fast imputation using medium or low-coverage sequence data
    Paul M. VanRaden
    Chuanyu Sun
    Jeffrey R. O’Connell
    BMC Genetics, 16
  • [8] Efficient Imputation of Missing Markers in Low-Coverage Genotyping-by-Sequencing Data from Multiparental Crosses
    Huang, B. Emma
    Raghavan, Chitra
    Mauleon, Ramil
    Broman, Karl W.
    Leung, Hei
    GENETICS, 2014, 197 (01) : 401 - 404
  • [9] Fast and Accurate 1000 Genomes Imputation Using Summary Statistics or Low-coverage Sequencing Data
    Pasaniuc, Bogdan
    Zaitlen, Noah
    Bhatia, Gaurav
    Gusev, Alexander
    Patterson, Nick
    Price, Alkes L.
    GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 765 - 765
  • [10] Fast imputation using medium or low-coverage sequence data
    VanRaden, Paul M.
    Sun, Chuanyu
    O'Connell, Jeffrey R.
    BMC GENETICS, 2015, 16