MulStack: An ensemble learning prediction model of multilabel mRNA subcellular localization

被引:2
|
作者
Liu Z. [1 ]
Bai T. [2 ,3 ,4 ]
Liu B. [3 ,4 ]
Yu L. [1 ]
机构
[1] School of Computer Science and Technology, Xidian University, Xian
[2] School of Mathematics & Computer Science, Yan'an University, Shaanxi
[3] School of Computer Science and Technology, Beijing Institute of Technology, Beijing
[4] Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing
基金
中国国家自然科学基金;
关键词
Deep learning; Ensemble learning predictor; mRNA features at two levels; Multilabel mRNA subcellular localization; Position encoding;
D O I
10.1016/j.compbiomed.2024.108289
中图分类号
学科分类号
摘要
Subcellular localization of mRNA is related to protein synthesis, cell polarity, cell movement and other biological regulation mechanisms. The distribution of mRNAs in subcellulars is similar to that of proteins, and most mRNAs are distributed in multiple subcellulars. Recently, some computational methods have been designed to predict the subcellular localization of mRNA. However, these methods only employed a sin-gle level of mRNA features and did not employ the position encoding of nucleotides in mRNA. In this paper, an ensemble learning prediction model is proposed, named MulStack, which is based on random forest and deep learning for multilabel mRNA subcellular localization. The proposed method employs two levels of mRNA features, including sequence-level and residue-level features, and position encoding is employed for the first time in the field of subcellular localization of mRNA. Random forest is employed to learn mRNA sequence-level feature, deep learning is employed to learn mRNA sequence-level feature and mRNA residue-level combined with position encoding. And the outputs of random forest and deep learning model will be weighted sum as the prediction probability. Compared with existing methods, the results show that MulStack is the best in the localization of the nucleus, cytosol and exosome. In addition, position weight matrices (PWMs) are extracted by convolutional neural networks (CNNs) that can be matched with known RNA binding protein motifs. Gene ontology (GO) enrichment analysis shows biological processes, molecular functions and cellular components of mRNA genes. The prediction web server of MulStack is freely accessible at http://bliulab.net/MulStack. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [21] MSLP: mRNA subcellular localization predictor based on machine learning techniques
    Saleh Musleh
    Mohammad Tariqul Islam
    Rizwan Qureshi
    Nehad M. Alajez
    Tanvir Alam
    BMC Bioinformatics, 24
  • [22] MiRLoc: predicting miRNA subcellular localization by incorporating miRNA-mRNA interactions and mRNA subcellular localization
    Xu, Mingmin
    Chen, Yuanyuan
    Xu, Zhihui
    Zhang, Liangyun
    Jiang, Hangjin
    Pian, Cong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [23] DeepLoc: prediction of protein subcellular localization using deep learning
    Armenteros, Jose Juan Almagro
    Sonderby, Casper Kaae
    Sonderby, Soren Kaae
    Nielsen, Henrik
    Winther, Ole
    BIOINFORMATICS, 2017, 33 (21) : 3387 - 3395
  • [24] Advancing mRNA subcellular localization prediction with graph neural network and RNA structure
    Li, Fuyi
    Bi, Yue
    Guo, Xudong
    Tan, Xiaolan
    Wang, Cong
    Pan, Shirui
    BIOINFORMATICS, 2024, 40 (08)
  • [25] Prediction of human protein subcellular localization using deep learning
    Wei, Leyi
    Ding, Yijie
    Su, Ran
    Tang, Jijun
    Zou, Quan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 212 - 217
  • [26] Protein subcellular and secreted localization prediction using deep learning
    Zidoum, Hamza
    Magdy, Mennatollah
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [27] Ensemble Learning for Robust Prediction of microRNA-mRNA Interactions
    Yu, Seunghak
    Kim, Juho
    Min, Hyeyoung
    Yoon, Sungroh
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 45 - +
  • [28] DeepmRNALoc: A Novel Predictor of Eukaryotic mRNA Subcellular Localization Based on Deep Learning
    Wang, Shihang
    Shen, Zhehan
    Liu, Taigang
    Long, Wei
    Jiang, Linhua
    Peng, Sihua
    MOLECULES, 2023, 28 (05):
  • [29] Correction: MSLP: mRNA subcellular localization predictor based on machine learning techniques
    Saleh Musleh
    Mohammad Tariqul Islam
    Rizwan Qureshi
    Nehad M. Alajez
    Tanvir Alam
    BMC Bioinformatics, 24
  • [30] Prediction of protein subcellular localization
    Yu, Chin-Sheng
    Chen, Yu-Ching
    Lu, Chih-Hao
    Hwang, Jenn-Kang
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (03) : 643 - 651