Co-training based prediction of multi-label protein–protein interactions

被引:0
|
作者
Tang T. [1 ]
Zhang X. [2 ]
Li W. [1 ]
Wang Q. [3 ]
Liu Y. [4 ,5 ]
Cao X. [6 ]
机构
[1] School of Modern Posts, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[2] Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore
[3] School of Management, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[4] College of Computer Science and Electronic Engineering, Hunan University, 2 Lushan Rd, Hunan, Changsha
[5] Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education, Anhui University, 111 Jiulong Road, Anhui, Hefei
[6] School of Artificial Intelligence, Jilin University, 2699 Qianjin St, Changchun, Jilin
基金
中国国家自然科学基金;
关键词
Computational PPI prediction; Deep learning; Machine learning; Protein–protein interaction;
D O I
10.1016/j.compbiomed.2024.108623
中图分类号
学科分类号
摘要
Prediction of protein–protein interaction (PPI) types enhances the comprehension of the underlying structural characteristics and functions of proteins, which gives rise to a multi-label classification problem. The nominal features describe the physicochemical characteristics of proteins directly, establishing a more robust correlation with the interaction types between proteins than ordered features. Motivated by this, we propose a multi-label PPI prediction model referred to as CoMPPI (Co-training based Multi-Label prediction of Protein–Protein Interaction). This approach aims to maximize the utility of both ordered and nominal features extracted from protein sequences. Specifically, CoMPPI incorporates graph convolutional network (GCN) and 1D convolution operation to process the complementary subsets of features individually, leveraging both local and contextualized information in a more efficient way. In addition, two multi-type PPI datasets were constructed to eliminate the duplication in previous datasets. We compare the performance of CoMPPI with three state-of-the-art methods on three datasets partitioned using distinct schemes (Breadth-first search, Depth-first search, and Random), CoMPPI consistently outperforms the other methods across all cases, demonstrating improvements ranging from 3.81% to 32.40% in Micro-F1. The subsequent ablation experiment confirms the efficacy of employing the co-training framework for multi-label PPI prediction, indicating promising avenues for future advancements in this domain. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Multi-Label Co-Training
    Xing, Yuying
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Zhang, Zili
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2882 - 2888
  • [2] Self-paced multi-label co-training
    Gong, Yanlu
    Wu, Quanwang
    Zhou, Mengchu
    Wen, Junhao
    INFORMATION SCIENCES, 2023, 622 : 269 - 281
  • [3] Multi-Label Learning with Co-Training Based on Semi-Supervised Regression
    Xu, Meixiang
    Sun, Fuming
    Jiang, Xiaojun
    2014 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2014, : 175 - 180
  • [4] Inductive Semi-supervised Multi-Label Learning with Co-Training
    Zhan, Wang
    Zhang, Min-Ling
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1305 - 1314
  • [5] Stacked co-training for semi-supervised multi-label learning
    Li, Jiaxuan
    Zhu, Xiaoyan
    Wang, Hongrui
    Zhang, Yu
    Wang, Jiayin
    INFORMATION SCIENCES, 2024, 677
  • [6] Classifying human rights violations using deep multi-label co-training
    Kihlman, Ragini
    Fasli, Maria
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4887 - 4895
  • [7] Multi-label Feature Selection Techniques for Hierarchical Multi-label Protein Function Prediction
    Cerri, Ricardo
    Mantovani, Rafael G.
    Basgalupp, Marcio P.
    de Carvalho, Andre C. P. L. F.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [8] Cluster Tree based Multi-Label Classification for Protein Function Prediction
    Wu, Qingyao
    Ye, Yunming
    Zhang, Xiaofeng
    Ho, Shen-Shyang
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [9] Co-training based on Semi-supervised Ensemble Classification Approach for Multi-label Data Stream
    Chu, Zhe
    Li, Peipei
    Hu, Xuegang
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 50 - 57
  • [10] Multi-Label Learning for Protein Subcellular Location Prediction
    Wang, Xiao
    Li, Guo-Zheng
    Liu, Jia-Ming
    Zhao, Rui-Wei
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 282 - 285