Pangenome graphs improve the analysis of structural variants in rare genetic diseases

被引:7
|
作者
Groza, Cristian [1 ]
Schwendinger-Schreck, Carl [2 ]
Cheung, Warren A. [2 ]
Farrow, Emily G. [2 ]
Thiffault, Isabelle [2 ]
Lake, Juniper [3 ]
Rizzo, William B. [4 ]
Evrony, Gilad [5 ]
Curran, Tom [6 ]
Bourque, Guillaume [7 ,8 ,9 ,10 ]
Pastinen, Tomi [2 ]
机构
[1] McGill Univ, Quantitat Life Sci, Montreal, PQ, Canada
[2] Childrens Mercy Res Inst, Genom Med Ctr, Kansas City, MO 64108 USA
[3] Pacific Biosci, Menlo Pk, CA USA
[4] Nebraska Med Ctr, Dept Pediat, Child Hlth Res Inst, Omaha, NE USA
[5] NYU, Grossman Sch Med, Ctr Human Genet & Genom, Dept Pediat Neurosci & Physiol, New York, NY USA
[6] Childrens Mercy Res Inst, Kansas City, MO USA
[7] McGill Univ, Canadian Ctr Computat Genom, Montreal, PQ, Canada
[8] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
[9] Kyoto Univ, Inst Adv Study Human Biol WPI ASHBi, Kyoto, Japan
[10] McGill Univ, Victor Phillip Dahdaleh Inst Genom Med, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1038/s41467-024-44980-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rare DNA alterations that cause heritable diseases are only partially resolvable by clinical next-generation sequencing due to the difficulty of detecting structural variation (SV) in all genomic contexts. Long-read, high fidelity genome sequencing (HiFi-GS) detects SVs with increased sensitivity and enables assembling personal and graph genomes. We leverage standard reference genomes, public assemblies (n = 94) and a large collection of HiFi-GS data from a rare disease program (Genomic Answers for Kids, GA4K, n = 574 assemblies) to build a graph genome representing a unified SV callset in GA4K, identify common variation and prioritize SVs that are more likely to cause genetic disease (MAF < 0.01). Using graphs, we obtain a higher level of reproducibility than the standard reference approach. We observe over 200,000 SV alleles unique to GA4K, including nearly 1000 rare variants that impact coding sequence. With improved specificity for rare SVs, we isolate 30 candidate SVs in phenotypically prioritized genes, including known disease SVs. We isolate a novel diagnostic SV in KMT2E, demonstrating use of personal assemblies coupled with pangenome graphs for rare disease genomics. The community may interrogate our pangenome with additional assemblies to discover new SVs within the allele frequency spectrum relevant to genetic diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Pangenome graphs improve the analysis of structural variants in rare genetic diseases
    Cristian Groza
    Carl Schwendinger-Schreck
    Warren A. Cheung
    Emily G. Farrow
    Isabelle Thiffault
    Juniper Lake
    William B. Rizzo
    Gilad Evrony
    Tom Curran
    Guillaume Bourque
    Tomi Pastinen
    Nature Communications, 15
  • [2] Genotyping structural variants in pangenome graphs using the vg toolkit
    Hickey, Glenn
    Heller, David
    Monlong, Jean
    Sibbesen, Jonas A.
    Siren, Jouni
    Eizenga, Jordan
    Dawson, Eric T.
    Garrison, Erik
    Novak, Adam M.
    Paten, Benedict
    GENOME BIOLOGY, 2020, 21 (01)
  • [3] Genotyping structural variants in pangenome graphs using the vg toolkit
    Glenn Hickey
    David Heller
    Jean Monlong
    Jonas A. Sibbesen
    Jouni Sirén
    Jordan Eizenga
    Eric T. Dawson
    Erik Garrison
    Adam M. Novak
    Benedict Paten
    Genome Biology, 21
  • [4] Improved Detection of Rare Genetic Variants for Diseases
    Zhang, Lei
    Pei, Yu-Fang
    Li, Jian
    Papasian, Christopher J.
    Deng, Hong-Wen
    PLOS ONE, 2010, 5 (11):
  • [5] Associating rare genetic variants with human diseases
    Zhang, Qunyuan
    FRONTIERS IN GENETICS, 2015, 6
  • [6] Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs
    Zanini, Silvia F.
    Bayer, Philipp E.
    Wells, Rachel
    Snowdon, Rod J.
    Batley, Jacqueline
    Varshney, Rajeev K.
    Nguyen, Henry T.
    Edwards, David
    Golicz, Agnieszka A.
    PLANT GENOME, 2022, 15 (01):
  • [7] Detecting Association with Rare Genetic Variants in Common Diseases
    Li, Yali
    Feng, Tao
    Elston, Robert C.
    Zhu, Xiaofeng
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 754 - 754
  • [8] Emerging genetic complexity and rare genetic variants in neurodegenerative brain diseases
    Federica Perrone
    Rita Cacace
    Julie van der Zee
    Christine Van Broeckhoven
    Genome Medicine, 13
  • [9] Emerging genetic complexity and rare genetic variants in neurodegenerative brain diseases
    Perrone, Federica
    Cacace, Rita
    van der Zee, Julie
    Van Broeckhoven, Christine
    GENOME MEDICINE, 2021, 13 (01)
  • [10] Identification of rare causal genetic variants in invasive pneumococcal diseases by exome analysis in children
    Gelin, Morgane
    Limou, Sophie
    Durand, Axelle
    Rousseau, Olivia
    Gourraud, Pierre-Antoine
    Gras-Leguen, Christele
    Lorton, Fleur
    Toubiana, Julie
    Launay, Elise
    Vince, Nicolas
    HLA, 2022, 99 (05) : 540 - 541