Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning

被引:10
|
作者
Li, Yiming [1 ]
Zeng, Min [1 ]
Wu, Yifan [1 ]
Li, Yaohang [2 ]
Li, Min [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
基金
中国国家自然科学基金;
关键词
Proteins; Feature extraction; Protein sequence; Biological information theory; Deep learning; Amino acids; Predictive models; essential protein prediction; ensemble learning; evolutionary information; PSSM; ESSENTIAL GENES; SUBCELLULAR-LOCALIZATION; IDENTIFICATION; LETHALITY; DATABASE;
D O I
10.1109/TCBB.2021.3122294
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins are considered the foundation of life as they are indispensable for the survival of living organisms. Computational methods for essential protein discovery provide a fast way to identify essential proteins. But most of them heavily rely on various biological information, especially protein-protein interaction networks, which limits their practical applications. With the rapid development of high-throughput sequencing technology, sequencing data has become the most accessible biological data. However, using only protein sequence information to predict essential proteins has limited accuracy. In this paper, we propose EP-EDL, an ensemble deep learning model using only protein sequence information to predict human essential proteins. EP-EDL integrates multiple classifiers to alleviate the class imbalance problem and to improve prediction accuracy and robustness. In each base classifier, we employ multi-scale text convolutional neural networks to extract useful features from protein sequence feature matrices with evolutionary information. Our computational results show that EP-EDL outperforms the state-of-the-art sequence-based methods. Furthermore, EP-EDL provides a more practical and flexible way for biologists to accurately predict essential proteins. The source code and datasets can be downloaded from https://github.com/CSUBioGroup/EP-EDL.
引用
收藏
页码:3263 / 3271
页数:9
相关论文
共 50 条
  • [31] Construction of an Ensemble Scheme for Stock Price Prediction Using Deep Learning Techniques
    Appati, Justice Kwame
    Denwar, Ismail Wafaa
    Owusu, Ebenezer
    Soli, Michael Agbo Tettey
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2021, 17 (02) : 72 - 95
  • [32] Prediction of spontaneous imbibition in porous media using deep and ensemble learning techniques
    Mahdaviara, Mehdi
    Sharifi, Mohammad
    Bakhshian, Sahar
    Shokri, Nima
    FUEL, 2022, 329
  • [33] Improving Individual Brain Age Prediction Using an Ensemble Deep Learning Framework
    Kuo, Chen-Yuan
    Tai, Tsung-Ming
    Lee, Pei-Lin
    Tseng, Chiu-Wang
    Chen, Chieh-Yu
    Chen, Liang-Kung
    Lee, Cheng-Kuang
    Chou, Kun-Hsien
    See, Simon
    Lin, Ching-Po
    FRONTIERS IN PSYCHIATRY, 2021, 12
  • [34] Ensemble deep learning and EfficientNet for accurate diagnosis of diabetic retinopathy
    Arora, Lakshay
    Singh, Sunil K.
    Kumar, Sudhakar
    Gupta, Hardik
    Alhalabi, Wadee
    Arya, Varsha
    Bansal, Shavi
    Chui, Kwok Tai
    Gupta, Brij B.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [35] An Ensemble Learning Approach for Accurate Energy Prediction in Residential Buildings
    Al-Rakhami, Mabrook
    Gumaei, Abdu
    Alsanad, Ahmed
    Alamri, Atif
    Hassan, Mohammad Mehedi
    IEEE ACCESS, 2019, 7 : 48328 - 48338
  • [36] Deep Ensemble Learning for Human Activity Recognition Using Smart hone
    Zhu, Ran
    Xiao, Zhuoling
    Cheng, Mo
    Zhou, Liang
    Yan, Bo
    Lin, Shuisheng
    Wen, HongKai
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [37] Accurate and fast calibration for FBG demodulation based on deep learning and ensemble learning
    Sheng, Wenjuan
    Yin, Xin
    Wen, Jianxiang
    Peng, G. D.
    OPTICS AND LASER TECHNOLOGY, 2024, 172
  • [38] DELPHI: accurate deep ensemble model for protein interaction sites prediction
    Li, Yiwei
    Golding, G. Brian
    Ilie, Lucian
    BIOINFORMATICS, 2021, 37 (07) : 896 - 904
  • [39] Churn Prediction using Ensemble Learning
    Wang, Xing
    Khang Nguyen
    Nguyen, Binh P.
    ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 56 - 60
  • [40] An Efficient Next Word Prediction for Accurate Information using Deep Learning Algorithms
    Rao, B. Tarakeswara
    Ramesh, E.
    Srinagesh, A.
    Rao, K. Srinivasa
    Kumar, N. Kiran
    Prasad, P. Siva
    Mallikarjuna, B. Naga
    Arun, K.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 665 - 669