Long-read sequencing of the zebrafish genome reorganizes genomic architecture

被引:13
|
作者
Chernyavskaya, Yelena [1 ,2 ]
Zhang, Xiaofei [2 ,3 ]
Liu, Jinze [4 ]
Blackburn, Jessica [1 ,2 ]
机构
[1] Univ Kentucky, Dept Cellular & Mol Biochem, Lexington, KY 40536 USA
[2] Univ Kentucky, Markey Canc Ctr, Lexington, KY 40536 USA
[3] Univ Kentucky, Dept Comp Sci, Lexington, KY 40536 USA
[4] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA 23284 USA
基金
美国国家卫生研究院;
关键词
Nanopore; MinION; Danio rerio; Reference assembly; Transposon; TRANSPOSABLE ELEMENTS; DOMAINS; SYSTEM;
D O I
10.1186/s12864-022-08349-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Nanopore sequencing technology has revolutionized the field of genome biology with its ability to generate extra-long reads that can resolve regions of the genome that were previously inaccessible to short-read sequencing platforms. Over 50% of the zebrafish genome consists of difficult to map, highly repetitive, low complexity elements that pose inherent problems for short-read sequencers and assemblers. Results We used long-read nanopore sequencing to generate a de novo assembly of the zebrafish genome and compared our assembly to the current reference genome, GRCz11. The new assembly identified 1697 novel insertions and deletions over one kilobase in length and placed 106 previously unlocalized scaffolds. We also discovered additional sites of retrotransposon integration previously unreported in GRCz11 and observed the expression of these transposable elements in adult zebrafish under physiologic conditions, implying they have active mobility in the zebrafish genome and contribute to the ever-changing genomic landscape. Conclusions We used nanopore sequencing to improve upon and resolve the issues plaguing the current zebrafish reference assembly, GRCz11. Zebrafish is a prominent model of human disease, and our corrected assembly will be useful for studies relying on interspecies comparisons and precise linkage of genetic events to disease phenotypes.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Benchmarking of long-read sequencing, assemblers and polishers for yeast genome
    Zhang, Xue
    Liu, Chen-Guang
    Yang, Shi-Hui
    Wang, Xia
    Bai, Feng-Wu
    Wang, Zhuo
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [22] Interrogating the "unsequenceable" genomic trinucleotide repeat disorders by long-read sequencing
    Liu, Qian
    Zhang, Peng
    Wang, Depeng
    Gu, Weihong
    Wang, Kai
    GENOME MEDICINE, 2017, 9
  • [23] Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing
    Qian Liu
    Peng Zhang
    Depeng Wang
    Weihong Gu
    Kai Wang
    Genome Medicine, 9
  • [24] Investigating the mitochondrial genomic landscape of Arabidopsis thaliana by long-read sequencing
    Masutani, Bansho
    Arimura, Shin-ichi
    Morishita, Shinichi
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (01)
  • [25] Long-read sequencing uncovers the adaptive topography of a carnivorous plant genome
    Lan, Tianying
    Renner, Tanya
    Ibarra-Laclette, Enrique
    Farr, Kimberly M.
    Chang, Tien-Hao
    Alan Cervantes-Perez, Sergio
    Zheng, Chunfang
    Sankoff, David
    Tang, Haibao
    Purbojati, Rikky W.
    Putra, Alexander
    Drautz-Moses, Daniela I.
    Schuster, Stephan C.
    Herrera-Estrella, Luis
    Albert, Victor A.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (22) : E4435 - E4441
  • [26] Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
    Yang, Shuaibing
    Zhao, Qianqian
    Tang, Lihua
    Chen, Zejia
    Wu, Zhaoting
    Li, Kaixin
    Lin, Ruoru
    Chen, Yang
    Ou, Danlin
    Zhou, Li
    Xu, Jianzhen
    Qin, Qingsong
    FRONTIERS IN GENETICS, 2022, 12
  • [27] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bai, Bing
    Wang, Yi
    Zhu, Ran
    Zhang, Yaolei
    Wang, Hong
    Fan, Guangyi
    Liu, Xin
    Shi, Hong
    Niu, Yuyu
    Ji, Weizhi
    JOURNAL OF GENETICS AND GENOMICS, 2022, 49 (10) : 975 - 978
  • [28] Long-read whole-genome sequencing for the genetic diagnosis of dystrophinopathies
    Xie, Zhiying
    Sun, Chengyue
    Zhang, Siwen
    Liu, Yilin
    Yu, Meng
    Zheng, Yiming
    Meng, Lingchao
    Acharya, Anushree
    Cornejo-Sanchez, Diana M.
    Wang, Gao
    Zhang, Wei
    Schrauwen, Isabelle
    Leal, Suzanne M.
    Wang, Zhaoxia
    Yuan, Yun
    ANNALS OF CLINICAL AND TRANSLATIONAL NEUROLOGY, 2020, 7 (10): : 2041 - 2046
  • [29] Long-read genome sequencing informs the molecular etiology of imprinting disorders
    Dixon, Katherine
    Shen, Yaoqing
    Chin, Hui-Lin
    Gazzaz, Nour
    Huynh, Stephanie
    Chan, Simon
    Zhang, Cathy
    Culibrk, Luka
    O'Neill, Kieran
    Mungall, Karen
    Mungall, Andrew
    Moore, Richard
    Gibson, William
    Chanoine, Jean-Pierre
    Boerkoel, Cornelius
    Jones, Steven
    GENETICS IN MEDICINE, 2022, 24 (03) : S214 - S215
  • [30] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bing Bai
    Yi Wang
    Ran Zhu
    Yaolei Zhang
    Hong Wang
    Guangyi Fan
    Xin Liu
    Hong Shi
    Yuyu Niu
    Weizhi Ji
    JournalofGeneticsandGenomics, 2022, 49 (10) : 975 - 978