DeepCNPP: Deep Learning Architecture to Distinguish the Promoter of Human Long Non-Coding RNA Genes and Protein-Coding Genes

被引:3
|
作者
Alam, Tanvir [1 ]
Islam, Mohammad Tariqul [2 ]
Househ, Mowafa [1 ]
Belhaouari, Samir Brahim [1 ]
Kawsar, Ferdaus Ahmed [3 ]
机构
[1] Hamad Bin Khalifa Univ HBKU, Coll Sci & Engn, Informat & Comp Technol Div, Doha, Qatar
[2] Southern Connecticut State Univ, Dept Comp Sci, New Haven, CT USA
[3] East Tennessee State Univ, Dept Comp, Johnson City, TN USA
关键词
deep learning; convolution neural network; long non-coding RNA; promoter;
D O I
10.3233/SHTI190061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Promoter region of protein-coding genes are gradually being well understood, yet no comparable studies exist for the promoter of long non-coding RNA (lncRNA) genes which has emerged as a global potential regulator in multiple cellular process and different diseases for human. To understand the difference in the transcriptional regulation pattern of these genes, previously, we proposed a machine learning based model to classify the promoter of protein-coding genes and lncRNA genes. In this study, we are presenting DeepCNPP (deep coding non-coding promoter predictor), an improved model based on deep learning (DL) framework to classify the promoter of lncRNA genes and protein-coding genes. We used convolution neural network (CNN) based deep network to classify the promoter of these two broad categories of human genes. Our computational model, built upon the sequence information only, was able to classify these two groups of promoters from human at a rate of 83.34% accuracy and outperformed the existing model. Further analysis and interpretation of the output from DeepCNPP architecture will enable us to understand the difference in transcription regulatory pattern for these two groups of genes.
引用
收藏
页码:232 / 235
页数:4
相关论文
共 50 条
  • [1] Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes
    Alam, Tanvir
    Medvedeva, Yulia A.
    Jia, Hui
    Brown, James B.
    Lipovich, Leonard
    Bajic, Vladimir B.
    PLOS ONE, 2014, 9 (10):
  • [2] Coregulatory long non-coding RNA and protein-coding genes in serum starved cells
    Wang, Fan
    Liang, Rui
    Soibam, Benjamin
    Yang, Jin
    Liu, Yu
    BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS, 2019, 1862 (01): : 84 - 95
  • [3] FibroDB: Expression Analysis of Protein-Coding and Long Non-Coding RNA Genes in Fibrosis
    Ilieva, Mirolyuba
    Miller, Henry E.
    Agarwal, Arav
    Paulus, Gabriela K.
    Madsen, Jens Hedelund
    Bishop, Alexander J. R.
    Kauppinen, Sakari
    Uchida, Shizuka
    NON-CODING RNA, 2022, 8 (01)
  • [4] CrohnDB: A Web Database for Expression Profiling of Protein-Coding and Long Non-Coding RNA Genes in Crohn Disease
    Distefano, Rebecca
    Ilieva, Mirolyuba
    Madsen, Jens Hedelund
    Uchida, Shizuka
    COMPUTATION, 2023, 11 (06)
  • [5] Long non-coding RNA transcriptome of uncharacterized samples can be accurately imputed using protein-coding genes
    Nath, Aritro
    Geeleher, Paul
    Huang, R. Stephanie
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (02) : 637 - 648
  • [6] Are orphan genes protein-coding, prediction artifacts, or non-coding RNAs?
    Prabh, Neel
    Roedelsperger, Christian
    BMC BIOINFORMATICS, 2016, 17
  • [7] Non-coding transcript variants of protein-coding genes - what are they good for?
    Dhamija, Sonam
    Menon, Manoj B.
    RNA BIOLOGY, 2018, 15 (08) : 1025 - 1031
  • [8] Are orphan genes protein-coding, prediction artifacts, or non-coding RNAs?
    Neel Prabh
    Christian Rödelsperger
    BMC Bioinformatics, 17
  • [9] Long non-coding RNAs are Transcriptional Regulators of Contractile Protein-coding Genes in Skeletal Muscle
    Resnick, Jessica D.
    Gilbert, Carolyn A.
    Lowrey, Angela J.
    Callier, Matthew C.
    Pandorf, Clay E.
    FASEB JOURNAL, 2018, 32 (01):
  • [10] Atlas of Schistosoma mansoni long non-coding RNAs and their expression correlation to protein-coding genes
    Vasconcelos, Elton J. R.
    Mesel, Vinicius C.
    daSilva, Lucas F.
    Pires, David S.
    Lavezzo, Guilherme M.
    Pereira, Adriana S. A.
    Amaral, Murilo S.
    Verjovski-Almeida, Sergio
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,