Co-training based prediction of multi-label protein–protein interactions

被引:0
|
作者
Tang T. [1 ]
Zhang X. [2 ]
Li W. [1 ]
Wang Q. [3 ]
Liu Y. [4 ,5 ]
Cao X. [6 ]
机构
[1] School of Modern Posts, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[2] Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore
[3] School of Management, Nanjing University of Posts and Telecommunications, 9 Wenyuan Rd, Jiangsu, Nanjing
[4] College of Computer Science and Electronic Engineering, Hunan University, 2 Lushan Rd, Hunan, Changsha
[5] Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education, Anhui University, 111 Jiulong Road, Anhui, Hefei
[6] School of Artificial Intelligence, Jilin University, 2699 Qianjin St, Changchun, Jilin
基金
中国国家自然科学基金;
关键词
Computational PPI prediction; Deep learning; Machine learning; Protein–protein interaction;
D O I
10.1016/j.compbiomed.2024.108623
中图分类号
学科分类号
摘要
Prediction of protein–protein interaction (PPI) types enhances the comprehension of the underlying structural characteristics and functions of proteins, which gives rise to a multi-label classification problem. The nominal features describe the physicochemical characteristics of proteins directly, establishing a more robust correlation with the interaction types between proteins than ordered features. Motivated by this, we propose a multi-label PPI prediction model referred to as CoMPPI (Co-training based Multi-Label prediction of Protein–Protein Interaction). This approach aims to maximize the utility of both ordered and nominal features extracted from protein sequences. Specifically, CoMPPI incorporates graph convolutional network (GCN) and 1D convolution operation to process the complementary subsets of features individually, leveraging both local and contextualized information in a more efficient way. In addition, two multi-type PPI datasets were constructed to eliminate the duplication in previous datasets. We compare the performance of CoMPPI with three state-of-the-art methods on three datasets partitioned using distinct schemes (Breadth-first search, Depth-first search, and Random), CoMPPI consistently outperforms the other methods across all cases, demonstrating improvements ranging from 3.81% to 32.40% in Micro-F1. The subsequent ablation experiment confirms the efficacy of employing the co-training framework for multi-label PPI prediction, indicating promising avenues for future advancements in this domain. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [21] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Ricardo Cerri
    Rodrigo C. Barros
    André C. P. L. F. de Carvalho
    Yaochu Jin
    BMC Bioinformatics, 17
  • [22] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    Jin, Yaochu
    BMC BIOINFORMATICS, 2016, 17
  • [23] A Multi-label Classifier for Prediction Membrane Protein Functional Types in Animal
    Zou, Hong-Liang
    JOURNAL OF MEMBRANE BIOLOGY, 2014, 247 (11): : 1141 - 1148
  • [24] Protein Function Prediction Using Multi-label Learning and ISOMAP Embedding
    Liang, Huadong
    Sun, Dengdi
    Ding, Zhuanlian
    Ge, Meiling
    BIO-INSPIRED COMPUTING - THEORIES AND APPLICATIONS, BIC-TA 2015, 2015, 562 : 249 - 259
  • [25] DeepLoc 2.1: multi-label membrane protein type prediction using protein language models
    Odum, Marius Thrane
    Teufel, Felix
    Thumuluri, Vineet
    Armenteros, Jose Juan Almagro
    Johansen, Alexander Rosenberg
    Winther, Ole
    Nielsen, Henrik
    NUCLEIC ACIDS RESEARCH, 2024, 52 (W1) : W215 - W220
  • [26] Identification of Protein Complex Based on Multi-label Learning Algorithm
    Yuan, Zhu
    Song, Liu
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6965 - 6969
  • [27] Exploiting PubMed for Protein Molecular Function Prediction via NMF based Multi-Label Classification
    Fodeh, Samah
    Tiwari, Aditya
    Yu, Hong
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 446 - 451
  • [28] Multi-view Co-training for microRNA Prediction
    Hassani, Mohsen Sheikh
    Green, James R.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [29] Multi-view Co-training for microRNA Prediction
    Mohsen Sheikh Hassani
    James R. Green
    Scientific Reports, 9
  • [30] Multi-Label Learning With Fuzzy Hypergraph Regularization for Protein Subcellular Location Prediction
    Chen, Jing
    Tang, Yuan Yan
    Chen, C. L. Philip
    Fang, Bin
    Lin, Yuewei
    Shang, Zhaowei
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (04) : 438 - 447