DeepCNPP: Deep Learning Architecture to Distinguish the Promoter of Human Long Non-Coding RNA Genes and Protein-Coding Genes

被引:3
|
作者
Alam, Tanvir [1 ]
Islam, Mohammad Tariqul [2 ]
Househ, Mowafa [1 ]
Belhaouari, Samir Brahim [1 ]
Kawsar, Ferdaus Ahmed [3 ]
机构
[1] Hamad Bin Khalifa Univ HBKU, Coll Sci & Engn, Informat & Comp Technol Div, Doha, Qatar
[2] Southern Connecticut State Univ, Dept Comp Sci, New Haven, CT USA
[3] East Tennessee State Univ, Dept Comp, Johnson City, TN USA
关键词
deep learning; convolution neural network; long non-coding RNA; promoter;
D O I
10.3233/SHTI190061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Promoter region of protein-coding genes are gradually being well understood, yet no comparable studies exist for the promoter of long non-coding RNA (lncRNA) genes which has emerged as a global potential regulator in multiple cellular process and different diseases for human. To understand the difference in the transcriptional regulation pattern of these genes, previously, we proposed a machine learning based model to classify the promoter of protein-coding genes and lncRNA genes. In this study, we are presenting DeepCNPP (deep coding non-coding promoter predictor), an improved model based on deep learning (DL) framework to classify the promoter of lncRNA genes and protein-coding genes. We used convolution neural network (CNN) based deep network to classify the promoter of these two broad categories of human genes. Our computational model, built upon the sequence information only, was able to classify these two groups of promoters from human at a rate of 83.34% accuracy and outperformed the existing model. Further analysis and interpretation of the output from DeepCNPP architecture will enable us to understand the difference in transcription regulatory pattern for these two groups of genes.
引用
收藏
页码:232 / 235
页数:4
相关论文
共 50 条
  • [21] Identifying critical protein-coding genes and long non-coding RNAs in non-functioning pituitary adenoma recurrence
    Guo, Jing
    Fang, Qiuyue
    Liu, Yulou
    Xie, Weiyan
    Zhang, Yazhuo
    Li, Chuzhong
    ONCOLOGY LETTERS, 2021, 21 (04)
  • [22] Expression changes in protein-coding genes and long non-coding RNAs in denatured dermis following thermal injury
    Yu, Wenchang
    Guo, Zaiwen
    Liang, Pengfei
    Jiang, Bimei
    Guo, Le
    Duan, Mengting
    Huang, Xu
    Zhang, Pihong
    Zhang, Minghua
    Ren, Licheng
    Zeng, Jizhang
    Huang, Xiaoyuan
    BURNS, 2020, 46 (05) : 1128 - 1135
  • [23] Hominoid-Specific De Novo Protein-Coding Genes Originating from Long Non-Coding RNAs
    Xie, Chen
    Zhang, Yong E.
    Chen, Jia-Yu
    Liu, Chu-Jun
    Zhou, Wei-Zhen
    Li, Ying
    Zhang, Mao
    Zhang, Rongli
    Wei, Liping
    Li, Chuan-Yun
    PLOS GENETICS, 2012, 8 (09):
  • [24] A guide to naming human non-coding RNA genes
    Seal, Ruth L.
    Chen, Ling-Ling
    Griffiths-Jones, Sam
    Lowe, Todd M.
    Mathews, Michael B.
    O'Reilly, Dawn
    Pierce, Andrew J.
    Stadler, Peter F.
    Ulitsky, Igor
    Wolin, Sandra L.
    Bruford, Elspeth A.
    EMBO JOURNAL, 2020, 39 (06):
  • [25] Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs
    Cheng, Chao
    Ung, Matthew
    Grant, Gavin D.
    Whitfield, Michael L.
    PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (07)
  • [26] New genes from non-coding sequence: the role of de novo protein-coding genes in eukaryotic evolutionary innovation
    McLysaght, Aoife
    Guerzoni, Daniele
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2015, 370 (1678)
  • [27] Comprehensive analysis of candidate signatures of long non-coding RNA LINC01116 and related protein-coding genes in patients with hepatocellular carcinoma
    Xiang-Kun Wang
    Xu-Dong Zhang
    Kai Luo
    Long Yu
    Shuai Huang
    Zhong-Yuan Liu
    Ren-Feng Li
    BMC Gastroenterology, 23
  • [28] Comprehensive analysis of candidate signatures of long non-coding RNA LINC01116 and related protein-coding genes in patients with hepatocellular carcinoma
    Wang, Xiang-Kun
    Zhang, Xu-Dong
    Luo, Kai
    Yu, Long
    Huang, Shuai
    Liu, Zhong-Yuan
    Li, Ren-Feng
    BMC GASTROENTEROLOGY, 2023, 23 (01)
  • [29] Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
    Delihas, Nicholas
    PLOS ONE, 2020, 15 (03):
  • [30] Long non-coding RNAs display higher natural expression variation than protein-coding genes in healthy humans
    Kornienko, Aleksandra E.
    Dotter, Christoph P.
    Guenzl, Philipp M.
    Gisslinger, Heinz
    Gisslinger, Bettina
    Cleary, Ciara
    Kralovics, Robert
    Pauler, Florian M.
    Barlow, Denise P.
    GENOME BIOLOGY, 2016, 17