Hybrid Sequencing of Full-Length cDNA Transcripts of the Medicinal Plant Scutellaria baicalensis

被引:11
|
作者
Gao, Ting [1 ]
Xu, Zhichao [2 ,3 ]
Song, Xiaojun [1 ]
Huang, Kai [4 ]
Li, Ying [2 ,3 ]
Wei, Jianhe [2 ,3 ]
Zhu, Xunzhi [5 ,6 ]
Ren, Hongwei [1 ]
Sun, Chao [2 ,3 ]
机构
[1] Qingdao Agr Univ, Coll Life Sci, Key Lab Plant Biotechnol Univ Shandong Prov, Qingdao 266109, Shandong, Peoples R China
[2] Peking Union Med Coll, Key Lab Chinese Med Resources Conservat, State Adm Tradit Chinese Med Peoples Republ China, Inst Med Plant Dev, Beijing 100193, Peoples R China
[3] Chinese Acad Med Sci, Beijing 100193, Peoples R China
[4] Beijing IgeneCode Biotech Co Ltd, Changping Dist Xisanqi Ctr Olymp Century, Beijing 100096, Peoples R China
[5] Inst Bot, Nanjing 210014, Jiangsu, Peoples R China
[6] Chinese Acad Sci, Nanjing 210014, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Scutellaria baicalensis; single-molecule real-time sequence; flavonoid; key genes; alternative splicing; LONG NONCODING RNAS; CINNAMATE; 4-HYDROXYLASE; OVEREXPRESSION; BIOSYNTHESIS; GENES; ACCUMULATION; MECHANISMS; COMPLEXITY; LANDSCAPE;
D O I
10.3390/ijms20184426
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Scutellaria baicalensis is a well-known medicinal plant that produces biologically active flavonoids, such as baicalin, baicalein, and wogonin. Pharmacological studies have shown that these compounds have anti-inflammatory, anti-bacterial, and anti-cancer activities. Therefore, it is of great significance to investigate the genetic information of S. baicalensis, particularly the genes related to the biosynthetic pathways of these compounds. Here, we constructed the full-length transcriptome of S. baicalensis using a hybrid sequencing strategy and acquired 338,136 full-length sequences, accounting for 93.3% of the total reads. After the removal of redundancy and correction with Illumina short reads, 75,785 nonredundant transcripts were generated, among which approximately 98% were annotated with significant hits in the protein databases, and 11,135 sequences were classified as lncRNAs. Differentially expressed gene (DEG) analysis showed that most of the genes related to flavonoid biosynthesis were highly expressed in the roots, consistent with previous reports that the flavonoids were mainly synthesized and accumulated in the roots of S. baicalensis. By constructing unique transcription models, a total of 44,071 alternative splicing (AS) events were identified, with intron retention (IR) accounting for the highest proportion (44.5%). A total of 94 AS events were present in five key genes related to flavonoid biosynthesis, suggesting that AS may play important roles in the regulation of flavonoid biosynthesis in S. baicalensis. This study provided a large number of highly accurate full-length transcripts, which represents a valuable genetic resource for further research of the molecular biology of S. baicalensis, such as the development, breeding, and biosynthesis of active ingredients.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] CLONING AND SEQUENCING OF A FULL-LENGTH RAT SUCRASE-ISOMALTASE-ENCODING CDNA
    CHANDRASENA, G
    OSTERHOLM, DE
    SUNITHA, I
    HENNING, SJ
    GENE, 1994, 150 (02) : 355 - 360
  • [22] Sequencing of first-strand cDNA library reveals full-length transcriptomes
    Saurabh Agarwal
    Todd S. Macfarlan
    Maureen A. Sartor
    Shigeki Iwase
    Nature Communications, 6
  • [23] Sequencing of first-strand cDNA library reveals full-length transcriptomes
    Agarwal, Saurabh
    Macfarlan, Todd S.
    Sartor, Maureen A.
    Iwase, Shigeki
    NATURE COMMUNICATIONS, 2015, 6
  • [24] Antisense transcripts with rice full-length cDNAs
    Naoki Osato
    Hitomi Yamada
    Kouji Satoh
    Hisako Ooka
    Makoto Yamamoto
    Kohji Suzuki
    Jun Kawai
    Piero Carninci
    Yasuhiro Ohtomo
    Kazuo Murakami
    Kenichi Matsubara
    Shoshi Kikuchi
    Yoshihide Hayashizaki
    Genome Biology, 5
  • [25] Reply to: LncADeep performance on full-length transcripts
    Noorul Amin
    Annette McGrath
    Yi-Ping Phoebe Chen
    Nature Machine Intelligence, 2021, 3 : 196 - 196
  • [26] Reply to: LncADeep performance on full-length transcripts
    Amin, Noorul
    McGrath, Annette
    Chen, Yi-Ping Phoebe
    NATURE MACHINE INTELLIGENCE, 2021, 3 (03) : 196 - 196
  • [27] Antisense transcripts with rice full-length cDNAs
    Osato, N
    Yamada, H
    Satoh, K
    Ooka, H
    Yamamoto, M
    Suzuki, K
    Kawai, J
    Carninci, P
    Ohtomo, Y
    Murakami, K
    Matsubara, K
    Kikuchi, S
    Hayashizaki, Y
    GENOME BIOLOGY, 2004, 5 (01)
  • [28] TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing
    Xia, Yuchao
    Jin, Zijie
    Zhang, Chengsheng
    Ouyang, Linkun
    Dong, Yuhao
    Li, Juan
    Guo, Lvze
    Jing, Biyang
    Shi, Yang
    Miao, Susheng
    Xi, Ruibin
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [29] Full-Length Isoform Sequencing Reveals Novel Transcripts and Substantial Transcriptional Overlaps in a Herpesvirus
    Tombacz, Dora
    Csabai, Zsolt
    Olah, Peter
    Balazs, Zsolt
    Liko, Istvan
    Zsigmond, Laura
    Sharon, Donald
    Snyder, Michael
    Boldogkoi, Zsolt
    PLOS ONE, 2016, 11 (09):
  • [30] TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing
    Yuchao Xia
    Zijie Jin
    Chengsheng Zhang
    Linkun Ouyang
    Yuhao Dong
    Juan Li
    Lvze Guo
    Biyang Jing
    Yang Shi
    Susheng Miao
    Ruibin Xi
    Nature Communications, 14