Re-annotation of the Liriodendron chinense genome identifies novel genes and improves genome annotation quality

被引:3
|
作者
Wu, Hainan [1 ,2 ]
Hao, Ziyuan [1 ,2 ]
Tu, Zhonghua [1 ,2 ]
Zong, Yaxian [1 ,2 ]
Yang, Lichun [1 ,2 ]
Tong, Chunfa [1 ,2 ]
Li, Huogen [1 ,2 ]
机构
[1] Nanjing Forestry Univ, State Key Lab Tree Genet & Breeding, Nanjing, Peoples R China
[2] Nanjing Forestry Univ, Coinnovat Ctr Sustainable Forestry Southern China, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Liriodendron chinense; Genome annotation; PacBio Isoseq; Illumina RNA-seq; Novel genes; RNA-SEQ; TRANSCRIPTION FACTOR; DISEASE RESISTANCE; PROGRAM; PREDICTION;
D O I
10.1007/s11295-023-01605-x
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
Liriodendron chinense is a very popular tree used in landscaping because of its beautiful flowers and unique leaf shape. Although the L. chinense genome was released in 2018 based on PacBio long reads and genetic linkage map and provides a valuable resource for gene function discovery and transcriptome analyses of L. chinense. Yet, that version of the L. chinense genome relied on ab initio annotation and a small amount of transcriptome data, and lacks 5 ' UTRs and 3 ' UTRs and alternative splicing (AS) annotation; hence, it is imperative that an improved annotation be sought. Herein, we re-annotated the structure and function of genes across the entire L. chinense nuclear genome based on PacBio long reads and Illumina short reads retrieved from near-round organization types. This updated annotation of the L. chinense genome, Lchi2.1, includes a total of 42,831 gene models with a high proportion (92.8%) of complete BUSCOs. Among the Lchi2.1 annotation, 21,324 genes were modified, 15,770 novel genes were added, 14,851 genes were augmented with information on the 5 ' UTRs and 3 ' UTRs, and 6751 genes were found as alternatively spliced isoforms at the whole-genome scale. Additionally, we re-analyzed the transcriptome data across various stages of leaf shape development for L. chinense, founding many genes enriched to post-embryonic plant morphogenesis, plant organ formation, and anatomical structure formation involved in morphogenesis, which suggests these genes could be related to leaf morphological processes. Overall, the updated annotation of the L. chinense genome presented in this study will contribute to gene functional studies in L. chinense and its comparative genomic analysis with related plant species.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome sequence
    Ozan Gundogdu
    Stephen D Bentley
    Matt T Holden
    Julian Parkhill
    Nick Dorrell
    Brendan W Wren
    BMC Genomics, 8
  • [22] Improved chromosomal-level genome assembly and re-annotation of leopard coral grouper
    Han, Wentao
    Wu, Shaoxuan
    Ding, Hui
    Wang, Mingyi
    Wang, Mengya
    Bao, Zhenmin
    Wang, Bo
    Hu, Jingjie
    SCIENTIFIC DATA, 2023, 10 (01)
  • [23] Genome Wide Re-Annotation of Caldicellulosiruptor saccharolyticus with New Insights into Genes Involved in Biomass Degradation and Hydrogen Production
    Chowdhary, Nupoor
    Selvaraj, Ashok
    KrishnaKumaar, Lakshmi
    Kumar, Gopal Ramesh
    PLOS ONE, 2015, 10 (07):
  • [24] Improved chromosomal-level genome assembly and re-annotation of leopard coral grouper
    Wentao Han
    Shaoxuan Wu
    Hui Ding
    Mingyi Wang
    Mengya Wang
    Zhenmin Bao
    Bo Wang
    Jingjie Hu
    Scientific Data, 10
  • [25] Bacillus pumilus SAFR-032 Genome Revisited: Sequence Update and Re-Annotation
    Stepanov, Victor G.
    Tirumalai, Madhan R.
    Montazari, Saied
    Checinska, Aleksandra
    Venkateswaran, Kasthuri
    Fox, George E.
    PLOS ONE, 2016, 11 (06):
  • [26] Re-annotation of genome microbial CoDing-Sequences:: finding new genes and inaccurately annotated genes -: art. no. 5
    Bocs, S
    Danchin, A
    Médigue, C
    BMC BIOINFORMATICS, 2002, 3 (1)
  • [27] Genome (re-)annotation and open-source annotation pipelines
    Siezen, Roland J.
    van Hijum, Sacha A. F. T.
    MICROBIAL BIOTECHNOLOGY, 2010, 3 (04): : 362 - 369
  • [28] LongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse
    Wahl, MB
    Heinzmann, U
    Imai, K
    BIOINFORMATICS, 2005, 21 (08) : 1393 - 1400
  • [29] Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome and functional characterisation of selected genes involved in strain pathogenesis
    Gundogdu, O.
    Bentley, S. D.
    Holden, M. T.
    Parkhill, J.
    Wren, B. W.
    Dorrell, N.
    ZOONOSES AND PUBLIC HEALTH, 2007, 54 : 96 - 96
  • [30] Brassica rapa Genome 2.0: A Reference Upgrade through Sequence Re-assembly and Gene Re-annotation
    Cai, Chengcheng
    Wang, Xiaobo
    Liu, Bo
    Wu, Jian
    Liang, Jianli
    Cui, Yinan
    Cheng, Feng
    Wang, Xiaowu
    MOLECULAR PLANT, 2017, 10 (04) : 649 - 651