Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer

被引:45
|
作者
Fujimoto, Akihiro [1 ,2 ]
Wong, Jing Hao [1 ,2 ]
Yoshii, Yukiko [2 ]
Akiyama, Shintaro [3 ,4 ]
Tanaka, Azusa [1 ,2 ]
Yagi, Hitomi [2 ]
Shigemizu, Daichi [3 ,4 ]
Nakagawa, Hidewaki [3 ,4 ]
Mizokami, Masashi [5 ]
Shimada, Mihoko [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dept Human Genet, Tokyo, Japan
[2] Kyoto Univ, Dept Drug Discovery Med, Grad Sch Med, Kyoto, Japan
[3] Natl Ctr Geriatr & Gerontol, Med Genome Ctr, Obu, Japan
[4] RIKEN Ctr Integrat Med Sci, Lab Canc Genom, Yokohama, Kanagawa, Japan
[5] Natl Ctr Global Hlth & Med, Genome Med Sci Project, Tokyo, Japan
关键词
Long reads; Origin of structural variations (SVs); Germline SVs; Somatic SVs; ALIGNMENT; IDENTIFICATION; LANDSCAPE; IMPACT;
D O I
10.1186/s13073-021-00883-1
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background Identification of germline variation and somatic mutations is a major issue in human genetics. However, due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variation and somatic mutations is far from complete. Methods In the present study, we performed whole-genome sequencing using long-read sequencing technology (Oxford Nanopore) for 11 Japanese liver cancers and matched normal samples which were previously sequenced for the International Cancer Genome Consortium (ICGC). We constructed an analysis pipeline for the long-read data and identified germline and somatic structural variations (SVs). Results In polymorphic germline SVs, our analysis identified 8004 insertions, 6389 deletions, 27 inversions, and 32 intra-chromosomal translocations. By comparing to the chimpanzee genome, we correctly inferred events that caused insertions and deletions and found that most insertions were caused by transposons and Alu is the most predominant source, while other types of insertions, such as tandem duplications and processed pseudogenes, are rare. We inferred mechanisms of deletion generations and found that most non-allelic homolog recombination (NAHR) events were caused by recombination errors in SINEs. Analysis of somatic mutations in liver cancers showed that long reads could detect larger numbers of SVs than a previous short-read study and that mechanisms of cancer SV generation were different from that of germline deletions. Conclusions Our analysis provides a comprehensive catalog of polymorphic and somatic SVs, as well as their possible causes. Our software are available at and .
引用
收藏
页数:15
相关论文
共 44 条
  • [21] Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
    Amin Ghareyazi
    Amirreza Kazemi
    Kimia Hamidieh
    Hamed Dashti
    Maedeh Sadat Tahaei
    Hamid R. Rabiee
    Hamid Alinejad-Rokny
    Iman Dehzangi
    BMC Bioinformatics, 23
  • [22] Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
    Ghareyazi, Amin
    Kazemi, Amirreza
    Hamidieh, Kimia
    Dashti, Hamed
    Tahaei, Maedeh Sadat
    Rabiee, Hamid R.
    Alinejad-Rokny, Hamid
    Dehzangi, Iman
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [23] Whole-genome re-sequencing reveals genetic structure and selection signals of different populations in Megalobrama amblycephala
    Wang, Shi-Long
    Luo, Li-Fei
    Yu, Yue
    Fu, Yu-Ye
    Gao, Ze-Xia
    AQUACULTURE, 2025, 595
  • [24] Correction: Corrigendum: Whole-genome sequencing of bladder cancers reveals somatic CDKN1A mutations and clinicopathological associations with mutation burden
    J. -B. Cazier
    S. R. Rao
    C. M. McLean
    A. K. Walker
    B. J. Wright
    E. E. M. Jaeger
    C. Kartsonaki
    L. Marsden
    C. Yau
    C. Camps
    P. Kaisaki
    J. Taylor
    J. W. Catto
    I. P. M. Tomlinson
    A. E. Kiltie
    F. C. Hamdy
    Nature Communications, 5
  • [25] Correction: Corrigendum: Whole-genome sequencing of bladder cancers reveals somatic CDKN1A mutations and clinicopathological associations with mutation burden
    J. -B. Cazier
    S. R. Rao
    C. M. McLean
    A. K. Walker
    B. J. Wright
    E. E. M. Jaeger
    C. Kartsonaki
    L. Marsden
    C. Yau
    C. Camps
    P. Kaisaki
    J. Taylor
    J. W. Catto
    I. P. M. Tomlinson
    A. E. Kiltie
    F. C. Hamdy
    Nature Communications, 5
  • [26] Identifying Phased Mutations and Complex Rearrangements in Human Prostate Cancer Cell Lines through Linked-Read Whole-Genome Sequencing
    Pham, Minh -Tam
    Gupta, Anuj
    Gupta, Harshath
    Vaghasia, Ajay
    Skaist, Alyza
    Garrison, McKinzie A.
    Coulter, Jonathan B.
    Haffner, Michael C.
    Zheng, S. Lilly
    Xu, Jianfeng
    Shields, Christina DeStefano
    Isaacs, William B.
    Wheelan, Sarah J.
    Nelson, William G.
    Yegnasubramanian, Srinivasan
    MOLECULAR CANCER RESEARCH, 2022, 20 (07) : 1013 - 1020
  • [27] Identification of novel somatic mutations and structural variations in tongue cancer, matched leukoplakia and normal tissue by genome sequencing
    Kuriakose, M. Abraham
    Panda, B.
    Pattnaik, S.
    Choudhury, R. Pait
    Suresh, A.
    Kekatpure, V.
    Trivedi, N.
    ORAL ONCOLOGY, 2011, 47 : S13 - S13
  • [28] Genome-wide detection of structural variation in some sheep breeds using whole-genome long-read sequencing data
    Qiao, Guoyan
    Xu, Pan
    Guo, Tingting
    He, Xue
    Yue, Yaojing
    Yang, Bohui
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2024, 141 (04) : 403 - 414
  • [29] Tumor profiling from whole-genome and whole transcriptome sequencing to uncover gene fusions and structural variations in clinically relevant cancer genes
    Gylfe, Alexandra E.
    Shinbrot, Eve
    Kakaradov, Boyko
    Delport, Wayne
    Lau, Cctrine K.
    Krishnan, Subha
    Perlina, Ally
    Natarajan, Thanemozhi G.
    Sainger, Rachana
    Pelak, Kimberly
    Choppa, Paul
    Cooc, Janine
    Biggs, William
    Smith, Heather L.
    Perkins, Brad
    Jones, Daniel G.
    Nehring, Ramlah B.
    Cohen, Ezra E. W.
    Bloom, Kenneth Joel
    JOURNAL OF CLINICAL ONCOLOGY, 2017, 35
  • [30] Deep whole-genome sequencing of multiple proband tissues and parental blood reveals the complex genetic etiology of congenital diaphragmatic hernias
    Bogenschutz, Eric L.
    Fox, Zac D.
    Farrell, Andrew
    Wynn, Julia
    Moore, Barry
    Yu, Lan
    Aspelund, Gudrun
    Marth, Gabor
    Yandell, Mark
    Shen, Yufeng
    Chung, Wendy K.
    Kardon, Gabrielle
    HUMAN GENETICS AND GENOMICS ADVANCES, 2020, 1 (01):