Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning

被引:10
|
作者
Li, Yiming [1 ]
Zeng, Min [1 ]
Wu, Yifan [1 ]
Li, Yaohang [2 ]
Li, Min [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
基金
中国国家自然科学基金;
关键词
Proteins; Feature extraction; Protein sequence; Biological information theory; Deep learning; Amino acids; Predictive models; essential protein prediction; ensemble learning; evolutionary information; PSSM; ESSENTIAL GENES; SUBCELLULAR-LOCALIZATION; IDENTIFICATION; LETHALITY; DATABASE;
D O I
10.1109/TCBB.2021.3122294
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins are considered the foundation of life as they are indispensable for the survival of living organisms. Computational methods for essential protein discovery provide a fast way to identify essential proteins. But most of them heavily rely on various biological information, especially protein-protein interaction networks, which limits their practical applications. With the rapid development of high-throughput sequencing technology, sequencing data has become the most accessible biological data. However, using only protein sequence information to predict essential proteins has limited accuracy. In this paper, we propose EP-EDL, an ensemble deep learning model using only protein sequence information to predict human essential proteins. EP-EDL integrates multiple classifiers to alleviate the class imbalance problem and to improve prediction accuracy and robustness. In each base classifier, we employ multi-scale text convolutional neural networks to extract useful features from protein sequence feature matrices with evolutionary information. Our computational results show that EP-EDL outperforms the state-of-the-art sequence-based methods. Furthermore, EP-EDL provides a more practical and flexible way for biologists to accurately predict essential proteins. The source code and datasets can be downloaded from https://github.com/CSUBioGroup/EP-EDL.
引用
收藏
页码:3263 / 3271
页数:9
相关论文
共 50 条
  • [1] Accurate prediction of essential proteins using ensemble machine learning
    Lu, Dezhi
    Wu, Hao
    Hou, Yutong
    Wu, Yuncheng
    Liu, Yuanyuan
    Wang, Jinwu
    CHINESE PHYSICS B, 2025, 34 (01)
  • [2] Accurate prediction of essential proteins using ensemble machine learning
    鲁德志
    吴淏
    侯俞彤
    吴云成
    刘媛媛
    王金武
    Chinese Physics B, 2025, 34 (01) : 112 - 119
  • [3] De novo Prediction of Moonlighting Proteins Using Multimodal Deep Ensemble Learning
    Li, Ying
    Zhao, Jianing
    Liu, Zhaoqian
    Wang, Cankun
    Wei, Lizheng
    Han, Siyu
    Du, Wei
    FRONTIERS IN GENETICS, 2021, 12
  • [4] A Sequence-Based Prediction Model of Vesicular Transport Proteins Using Ensemble Deep Learning
    Le, Nguyen Quoc Khanh
    Kha, Quang Hien
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [5] Efficient and accurate diagnosis of otomycosis using an ensemble deep learning model
    Mao, Chenggang
    Li, Aimin
    Wang, Juehui
    Sun, Yi
    Peng, Dan
    MEDICAL MYCOLOGY, 2022, 60 (SUPP 1) : 247 - 247
  • [6] Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
    Yin, Zhen-Ning
    Lai, Fei-Liao
    Gao, Feng
    BRIEFINGS IN BIOINFORMATICS, 2023, 25 (01)
  • [7] Ensemble learning for accurate prediction of heart sounds using gammatonegram images
    Singh, Sinam Ashinikumar
    Singh, Sinam Ajitkumar
    Singh, Aheibham Dinamani
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2024, 32 (04) : 555 - 573
  • [8] SAPPHIRE: A stacking-based ensemble learning framework for accurate prediction of thermophilic proteins
    Charoenkwan, Phasit
    Schaduangrat, Nalini
    Moni, Mohammad Ali
    Lio, Pietro
    Manavalan, Balachandran
    Shoombuatong, Watshara
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [9] Enhancing Flood Prediction using Ensemble and Deep Learning Techniques
    Nti, Isaac Kofi
    Nyarko-Boateng, Owusu
    Boateng, Samuel
    Bawah, F. U.
    Agbedanu, P. R.
    Awarayi, N. S.
    Nimbe, P.
    Adekoya, A. F.
    Weyori, B. A.
    Akoto-Adjepong, Vivian
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 662 - 670
  • [10] Human Fall Prediction Using Ensemble Learning Technique
    Roy, A.
    Mukherjee, R.
    Moulik, S.
    Chakrabarti, A.
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 545 - 546