Co-training based prediction of multi-label protein–protein interactions

被引:0
|
作者
Tang T. [1 ]
Zhang X. [2 ]
Li W. [1 ]
Wang Q. [3 ]
Liu Y. [4 ,5 ]
Cao X. [6 ]
机构
[1] School of Modern Posts, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[2] Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore
[3] School of Management, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[4] College of Computer Science and Electronic Engineering, Hunan University, 2 Lushan Rd, Hunan, Changsha
[5] Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education, Anhui University, 111 Jiulong Road, Anhui, Hefei
[6] School of Artificial Intelligence, Jilin University, 2699 Qianjin St, Changchun, Jilin
基金
中国国家自然科学基金;
关键词
Computational PPI prediction; Deep learning; Machine learning; Protein–protein interaction;
D O I
10.1016/j.compbiomed.2024.108623
中图分类号
学科分类号
摘要
Prediction of protein–protein interaction (PPI) types enhances the comprehension of the underlying structural characteristics and functions of proteins, which gives rise to a multi-label classification problem. The nominal features describe the physicochemical characteristics of proteins directly, establishing a more robust correlation with the interaction types between proteins than ordered features. Motivated by this, we propose a multi-label PPI prediction model referred to as CoMPPI (Co-training based Multi-Label prediction of Protein–Protein Interaction). This approach aims to maximize the utility of both ordered and nominal features extracted from protein sequences. Specifically, CoMPPI incorporates graph convolutional network (GCN) and 1D convolution operation to process the complementary subsets of features individually, leveraging both local and contextualized information in a more efficient way. In addition, two multi-type PPI datasets were constructed to eliminate the duplication in previous datasets. We compare the performance of CoMPPI with three state-of-the-art methods on three datasets partitioned using distinct schemes (Breadth-first search, Depth-first search, and Random), CoMPPI consistently outperforms the other methods across all cases, demonstrating improvements ranging from 3.81% to 32.40% in Micro-F1. The subsequent ablation experiment confirms the efficacy of employing the co-training framework for multi-label PPI prediction, indicating promising avenues for future advancements in this domain. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [31] A hierarchical multi-label classification ant colony algorithm for protein function prediction
    Otero F.E.B.
    Freitas A.A.
    Johnson C.G.
    Memetic Computing, 2010, 2 (3) : 165 - 181
  • [32] ProStructNet: Integration of Protein Sequence and Structure for the Prediction of Multi-label Subcellular Localization
    Shi, Haopeng
    Zhang, Xiankun
    Deng, Qingxu
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 326 - 336
  • [33] Multi-label Correlated Semi-supervised Learning for Protein Function Prediction
    Jiang, Jonathan Q.
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 368 - 379
  • [34] Design and development of an efficient hierarchical approach for multi-label protein function prediction
    Prabha, Mohana G.
    Chitra, S.
    BIOMEDICAL RESEARCH-INDIA, 2017, 28
  • [35] Multi-task Optimization Based Co-training for Electricity Consumption Prediction
    Song, Hui
    Qin, A. K.
    Yan, Chenggang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] Protein function prediction through multi-view multi-label latent tensor reconstruction
    Armah-Sekum, Robert Ebo
    Szedmak, Sandor
    Rousu, Juho
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [37] Multi-scale deep learning for the imbalanced multi-label protein subcellular localization prediction based on immunohistochemistry images
    Wang, Fengsheng
    Wei, Leyi
    BIOINFORMATICS, 2022, 38 (09) : 2602 - 2611
  • [38] Multi-Label Hierarchical Classification using a Competitive Neural Network for Protein Function Prediction
    Borges, Helyane Bronoski
    Nievola, Julio Cesar
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [39] DeepLoc 2.0: multi-label subcellular localization prediction using protein language models
    Thumuluri, Vineet
    Armenteros, Jose Juan Almagro
    Johansen, Alexander Rosenberg
    Nielsen, Henrik
    Winther, Ole
    NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) : W228 - W234
  • [40] Hierarchical Multi-label Associative Classification for Protein Function Prediction Using Gene Ontology
    Sangsuriyun, Sawinee
    Rakthanmanon, Thanawin
    Waiyamai, Kitsana
    CHIANG MAI JOURNAL OF SCIENCE, 2019, 46 (01): : 165 - 179