Co-training based prediction of multi-label protein–protein interactions

被引:0
|
作者
Tang T. [1 ]
Zhang X. [2 ]
Li W. [1 ]
Wang Q. [3 ]
Liu Y. [4 ,5 ]
Cao X. [6 ]
机构
[1] School of Modern Posts, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[2] Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore
[3] School of Management, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[4] College of Computer Science and Electronic Engineering, Hunan University, 2 Lushan Rd, Hunan, Changsha
[5] Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education, Anhui University, 111 Jiulong Road, Anhui, Hefei
[6] School of Artificial Intelligence, Jilin University, 2699 Qianjin St, Changchun, Jilin
基金
中国国家自然科学基金;
关键词
Computational PPI prediction; Deep learning; Machine learning; Protein–protein interaction;
D O I
10.1016/j.compbiomed.2024.108623
中图分类号
学科分类号
摘要
Prediction of protein–protein interaction (PPI) types enhances the comprehension of the underlying structural characteristics and functions of proteins, which gives rise to a multi-label classification problem. The nominal features describe the physicochemical characteristics of proteins directly, establishing a more robust correlation with the interaction types between proteins than ordered features. Motivated by this, we propose a multi-label PPI prediction model referred to as CoMPPI (Co-training based Multi-Label prediction of Protein–Protein Interaction). This approach aims to maximize the utility of both ordered and nominal features extracted from protein sequences. Specifically, CoMPPI incorporates graph convolutional network (GCN) and 1D convolution operation to process the complementary subsets of features individually, leveraging both local and contextualized information in a more efficient way. In addition, two multi-type PPI datasets were constructed to eliminate the duplication in previous datasets. We compare the performance of CoMPPI with three state-of-the-art methods on three datasets partitioned using distinct schemes (Breadth-first search, Depth-first search, and Random), CoMPPI consistently outperforms the other methods across all cases, demonstrating improvements ranging from 3.81% to 32.40% in Micro-F1. The subsequent ablation experiment confirms the efficacy of employing the co-training framework for multi-label PPI prediction, indicating promising avenues for future advancements in this domain. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [41] ADAPTIVE THRESHOLDING FOR MULTI-LABEL SVM CLASSIFICATION WITH APPLICATION TO PROTEIN SUBCELLULAR LOCALIZATION PREDICTION
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3547 - 3551
  • [42] Dual Fuzzy Hypergraph Regularized Multi-label Learning for Protein Subcellular Location Prediction
    Chen, Jing
    Tang, Yuan Yan
    Chen, C. L. Philip
    Lin, Yuewei
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 512 - 516
  • [43] Genome-Wide Protein Function Prediction through Multi-Instance Multi-Label Learning
    Wu, Jian-Sheng
    Huang, Sheng-Jun
    Zhou, Zhi-Hua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (05) : 891 - 902
  • [44] A Multi-label Classifier for Human Protein Subcellular Localization Based on LSTM Networks
    Gao, Zhiying
    Sun, Lijun
    Wei, Zhihua
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (ACAAI 2018), 2018, 155 : 248 - 252
  • [45] MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou's pseudo amino acid composition and a novel multi-label classifier
    Wang, Xiao
    Zhang, Weiwei
    Zhang, Qiuwen
    Li, Guo-Zheng
    BIOINFORMATICS, 2015, 31 (16) : 2639 - 2645
  • [46] HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source
    Wan, Shixiang
    Duan, Yucong
    Zou, Quan
    PROTEOMICS, 2017, 17 (17-18)
  • [47] Function-Function Correlated Multi-label Protein Function Prediction over Interaction Networks
    Wang, Hua
    Huang, Heng
    Ding, Chris
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (04) : 322 - 343
  • [48] DCPE Co-Training: Co-Training Based on Diversity of Class Probability Estimation
    Xu, Jin
    He, Haibo
    Man, Hong
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [49] Multi-instance multi-label distance metric learning for genome-wide protein function prediction
    Xu, Yonghui
    Min, Huaqing
    Song, Hengjie
    Wu, Qingyao
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 63 : 30 - 40
  • [50] Multi-instance multi-label learning for labels with directed acyclic graph structures in protein function prediction
    Wu J.
    Tang S.
    Mei D.
    Zhu Y.
    Diao Y.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (03): : 23 - 30