Long-read sequencing and de novo genome assembly of Ammopiptanthus nanus, a desert shrub

被引:35
|
作者
Gao, Fei [1 ]
Wang, Xue [1 ]
Li, Xuming [2 ]
Xu, Mingyue [2 ]
Li, Huayun [3 ]
Abla, Merhaba [1 ]
Sun, Huigai [1 ]
Wei, Shanjun [1 ]
Feng, Jinchao [1 ]
Zhou, Yijun [1 ]
机构
[1] Minzu Univ China, Coll Life & Evironm Sci, 27 Zhongguancun South St, Beijing 100081, Peoples R China
[2] Biomarker Technol Corp, Floor 8,Shunjie Bldg,12 Fuqian Rd, Beijing 101300, Peoples R China
[3] Annoroad Genom, Beijing Econ Technol Dev Area, Bldg B1,Yard 88,Kechuang Six Rd, Beijing 100176, Peoples R China
来源
GIGASCIENCE | 2018年 / 7卷 / 07期
基金
中国国家自然科学基金;
关键词
Ammopiptanthus nanus; PacBio sequencing; genome assembly; genome annotation; PROVIDES INSIGHTS; PREDICTION; IDENTIFICATION; CLASSIFICATION; ANNOTATION; ALIGNMENT; TOOL;
D O I
10.1093/gigascience/giy074
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Ammopiptanthus nanus is a rare broad-leaved shrub that is found in the desert and arid regions of Central Asia. This plant species exhibits extremely high tolerance to drought and freezing and has been used in abiotic tolerance research in plants. As a relic of the tertiary period, A. nanus is of great significance to plant biogeographic research in the ancient Mediterranean region. Here, we report a draft genome assembly using the Pacific Biosciences (PacBio) platform and gene annotation for A. nanus. Findings: A total of 64.72 Gb of raw PacBio sequel reads were generated from four 20-kb libraries. After filtering, 64.53 Gb of clean reads were obtained, giving 72.59x coverage depth. Assembly using Canu gave an assembly length of 823.74 Mb, with a contig N50 of 2.76 Mb. The final size of the assembled A. nanus genome was close to the 889 Mb estimated by k-mer analysis. The gene annotation completeness was evaluated using Benchmarking Universal Single-Copy Orthologs; 1,327 of the 1,440 conserved genes (92.15%) could be found in the A. nanus assembly. Genome annotation revealed that 74.08% of the A. nanus genome is composed of repetitive elements and 53.44% is composed of long terminal repeat elements. We predicted 37,188 protein-coding genes, of which 96.53% were functionally annotated. Conclusions: The genomic sequences of A. nanus could be a valuable source for comparative genomic analysis in the legume family and will be useful for understanding the phylogenetic relationships of the Thermopsideae and the evolutionary response of plant species to the Qinghai Tibetan Plateau uplift.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Long-read sequencing and de novo assembly of a Chinese genome
    Shi, Lingling
    Guo, Yunfei
    Dong, Chengliang
    Huddleston, John
    Yang, Hui
    Han, Xiaolu
    Fu, Aisi
    Li, Quan
    Li, Na
    Gong, Siyi
    Lintner, Katherine E.
    Ding, Qiong
    Wang, Zou
    Hu, Jiang
    Wang, Depeng
    Wang, Feng
    Wang, Lin
    Lyon, Gholson J.
    Guan, Yongtao
    Shen, Yufeng
    Evgrafov, Oleg V.
    Knowles, James A.
    Thibaud-Nissen, Francoise
    Schneider, Valerie
    Yu, Chack-Yung
    Zhou, Libing
    Eichler, Evan E.
    So, Kwok-Fai
    Wang, Kai
    NATURE COMMUNICATIONS, 2016, 7
  • [2] Long-read sequencing and de novo assembly of a Chinese genome
    Lingling Shi
    Yunfei Guo
    Chengliang Dong
    John Huddleston
    Hui Yang
    Xiaolu Han
    Aisi Fu
    Quan Li
    Na Li
    Siyi Gong
    Katherine E. Lintner
    Qiong Ding
    Zou Wang
    Jiang Hu
    Depeng Wang
    Feng Wang
    Lin Wang
    Gholson J. Lyon
    Yongtao Guan
    Yufeng Shen
    Oleg V. Evgrafov
    James A. Knowles
    Francoise Thibaud-Nissen
    Valerie Schneider
    Chack-Yung Yu
    Libing Zhou
    Evan E. Eichler
    Kwok-Fai So
    Kai Wang
    Nature Communications, 7
  • [3] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bai, Bing
    Wang, Yi
    Zhu, Ran
    Zhang, Yaolei
    Wang, Hong
    Fan, Guangyi
    Liu, Xin
    Shi, Hong
    Niu, Yuyu
    Ji, Weizhi
    JOURNAL OF GENETICS AND GENOMICS, 2022, 49 (10) : 975 - 978
  • [4] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bing Bai
    Yi Wang
    Ran Zhu
    Yaolei Zhang
    Hong Wang
    Guangyi Fan
    Xin Liu
    Hong Shi
    Yuyu Niu
    Weizhi Ji
    JournalofGeneticsandGenomics, 2022, 49 (10) : 975 - 978
  • [5] Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes
    Samuel J. Hemmings
    Johanna L. Rhodes
    Matthew C. Fisher
    Mycopathologia, 2023, 188 : 409 - 412
  • [6] Long-read sequencing and de novo genome assembly of marine medaka (Oryzias melastigma)
    Pingping Liang
    Hafiz Sohaib Ahmed Saqib
    Xiaomin Ni
    Yingjia Shen
    BMC Genomics, 21
  • [7] Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes
    Hemmings, Samuel J.
    Rhodes, Johanna L.
    Fisher, Matthew C.
    MYCOPATHOLOGIA, 2023, 188 (04) : 409 - 412
  • [8] Long-read sequencing and de novo genome assembly of marine medaka (Oryzias melastigma)
    Liang, Pingping
    Saqib, Hafiz Sohaib Ahmed
    Ni, Xiaomin
    Shen, Yingjia
    BMC GENOMICS, 2020, 21 (01)
  • [9] Long-read sequencing and de novo assembly of the Luffa cylindrica (L.) Roem. genome
    Zhang, Tao
    Ren, Xuyan
    Zhang, Zhao
    Ming, Yao
    Yang, Zhe
    Hu, Jianbin
    Li, Shengli
    Wang, Yong
    Sun, Shouru
    Sun, Kaile
    Piao, Fengzhi
    Sun, Zhiqiang
    MOLECULAR ECOLOGY RESOURCES, 2020, 20 (02) : 511 - 519
  • [10] Complete de novo assembly of Wolbachia endosymbiont of Drosophila willistoni using long-read genome sequencing
    Jacobs, Jodie
    Nakamoto, Anne
    Mastoras, Mira
    Loucks, Hailey
    Mirchandani, Cade
    Karim, Lily
    Penunuri, Gabriel
    Wanket, Ciara
    Russell, Shelbi L.
    SCIENTIFIC REPORTS, 2024, 14 (01):