Long-read sequencing and de novo genome assembly of Ammopiptanthus nanus, a desert shrub

被引:35
|
作者
Gao, Fei [1 ]
Wang, Xue [1 ]
Li, Xuming [2 ]
Xu, Mingyue [2 ]
Li, Huayun [3 ]
Abla, Merhaba [1 ]
Sun, Huigai [1 ]
Wei, Shanjun [1 ]
Feng, Jinchao [1 ]
Zhou, Yijun [1 ]
机构
[1] Minzu Univ China, Coll Life & Evironm Sci, 27 Zhongguancun South St, Beijing 100081, Peoples R China
[2] Biomarker Technol Corp, Floor 8,Shunjie Bldg,12 Fuqian Rd, Beijing 101300, Peoples R China
[3] Annoroad Genom, Beijing Econ Technol Dev Area, Bldg B1,Yard 88,Kechuang Six Rd, Beijing 100176, Peoples R China
来源
GIGASCIENCE | 2018年 / 7卷 / 07期
基金
中国国家自然科学基金;
关键词
Ammopiptanthus nanus; PacBio sequencing; genome assembly; genome annotation; PROVIDES INSIGHTS; PREDICTION; IDENTIFICATION; CLASSIFICATION; ANNOTATION; ALIGNMENT; TOOL;
D O I
10.1093/gigascience/giy074
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Ammopiptanthus nanus is a rare broad-leaved shrub that is found in the desert and arid regions of Central Asia. This plant species exhibits extremely high tolerance to drought and freezing and has been used in abiotic tolerance research in plants. As a relic of the tertiary period, A. nanus is of great significance to plant biogeographic research in the ancient Mediterranean region. Here, we report a draft genome assembly using the Pacific Biosciences (PacBio) platform and gene annotation for A. nanus. Findings: A total of 64.72 Gb of raw PacBio sequel reads were generated from four 20-kb libraries. After filtering, 64.53 Gb of clean reads were obtained, giving 72.59x coverage depth. Assembly using Canu gave an assembly length of 823.74 Mb, with a contig N50 of 2.76 Mb. The final size of the assembled A. nanus genome was close to the 889 Mb estimated by k-mer analysis. The gene annotation completeness was evaluated using Benchmarking Universal Single-Copy Orthologs; 1,327 of the 1,440 conserved genes (92.15%) could be found in the A. nanus assembly. Genome annotation revealed that 74.08% of the A. nanus genome is composed of repetitive elements and 53.44% is composed of long terminal repeat elements. We predicted 37,188 protein-coding genes, of which 96.53% were functionally annotated. Conclusions: The genomic sequences of A. nanus could be a valuable source for comparative genomic analysis in the legume family and will be useful for understanding the phylogenetic relationships of the Thermopsideae and the evolutionary response of plant species to the Qinghai Tibetan Plateau uplift.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] De Novo Structural Variations of Escherichia coli Detected by Nanopore Long-Read Sequencing
    Zhou, Xia
    Pan, Jiao
    Wang, Yaohai
    Lynch, Michael
    Long, Hongan
    Zhang, Yu
    GENOME BIOLOGY AND EVOLUTION, 2023, 15 (06):
  • [42] Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
    Cook, David E.
    Valle-Inclan, Jose Espejo
    Pajoro, Alice
    Rovenich, Hanna
    Thomma, Bart P. H. J.
    Faino, Luigi
    PLANT PHYSIOLOGY, 2019, 179 (01) : 38 - 54
  • [43] Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations
    Cosma, Bianca-Maria
    Zade, Ramin Shirali Hossein
    Jordan, Erin Noel
    van Lent, Paul
    Peng, Chengyao
    Pillay, Stephanie
    Abeel, Thomas
    GIGASCIENCE, 2023, 12
  • [44] Long-read sequencing and genome assembly of natural history collection samples and challenging specimens
    Bein, Bernhard
    Chrysostomakis, Ioannis
    Arantes, Larissa S.
    Brown, Tom
    Gerheim, Charlotte
    Schell, Tilman
    Schneider, Clement
    Leushkin, Evgeny
    Chen, Zeyuan
    Sigwart, Julia
    Gonzalez, Vanessa
    Wong, Nur Leena W. S.
    Santos, Fabricio R.
    Blom, Mozes P. K.
    Mayer, Frieder
    Mazzoni, Camila J.
    Boehne, Astrid
    Winkler, Sylke
    Greve, Carola
    Hiller, Michael
    GENOME BIOLOGY, 2025, 26 (01):
  • [45] UNRAVELING THE ROLE OF DE NOVO STRUCTURAL VARIANTS IN SCHIZOPHRENIA THROUGH COMPREHENSIVE WHOLE GENOME SEQUENCING WITH LONG-READ AND SHORT-READ TECHNOLOGIES
    Zhang, Yamin
    Li, Tong
    Yang, Shaozhong
    Xie, Zhi
    Li, Tao
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2024, 87 : 13 - 14
  • [46] Long-read human genome sequencing and its applications
    Logsdon, Glennis A.
    Vollger, Mitchell R.
    Eichler, Evan E.
    NATURE REVIEWS GENETICS, 2020, 21 (10) : 597 - 614
  • [47] Advancements in long-read genome sequencing technologies and algorithms
    Espinosa, Elena
    Bautista, Rocio
    Larrosa, Rafael
    Plata, Oscar
    GENOMICS, 2024, 116 (03)
  • [48] Long-read human genome sequencing and its applications
    Glennis A. Logsdon
    Mitchell R. Vollger
    Evan E. Eichler
    Nature Reviews Genetics, 2020, 21 : 597 - 614
  • [49] HAT: de novo variant calling for highly accurate short-read and long-read sequencing data
    Ng, Jeffrey K.
    Turner, Tychele N.
    BIOINFORMATICS, 2024, 40 (01)
  • [50] De Novo Assembly of the Dirofilaria immitis Genome by Long-Read Nanopore-Based Sequencing Technology on an Adult Worm from a Canine Cardiopulmonary Dirofilariosis Case
    Gomes-de-Sa, Sonia
    Barradas, Patricia
    Queiros-Reis, Luis
    Matas, Isabel M.
    Amorim, Irina
    Cardoso, Luis
    Munoz-Merida, Antonio
    Mesquita, Joao R.
    ANIMALS, 2022, 12 (11):