A novel graph clustering method with a greedy heuristic search algorithm for mining protein complexes from dynamic and static PPI networks

被引:17
|
作者
Wang, Rongquan [1 ,2 ]
Wang, Caixia [3 ]
Liu, Guixia [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Jilin, Peoples R China
[3] China Foreign Affairs Univ, Sch Int Econ, Beijing 100037, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein-protein interaction networks; Protein complexes; Graph clustering method; Greedy heuristic search algorithm; Core proteins; Clustering model; FUNCTIONAL MODULES; IDENTIFICATION; LOCALIZATION; INTERACTOME;
D O I
10.1016/j.ins.2020.02.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovering protein complexes from protein-protein interaction (PPI) networks is one of the primary tasks in bioinformatics. However, most of the state-of-the-art methods still face some challenges, such as the inability to discover overlapping protein complexes, failure to consider the inherent structure of real protein complexes, and non-utilization of biological information. Based on the above mentioned aspects, we present a novel graph clustering method with a greedy heuristic search algorithm for mining protein complexes using a new clustering model in dynamic and static weighted PPI networks (named MPC-C). First, MPC-C constructed dynamic and static weighted PPI networks by combining biological and topological information. Second, initial clusters were obtained using core and multifunctional proteins, following which we proposed a greedy heuristic search algorithm to expand each initial cluster and form candidate protein complexes in dynamic and static weighted PPI networks. Finally, unreliable and highly overlapping protein complexes were discarded. To demonstrate the performance of MPC-C, we tested this method on five PPI networks and compared it with nine other effective methods. The experimental results indicate that MPC-C outperformed the other state-of-the-art methods with respect to various computational and biologically relevant metrics. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:275 / 298
页数:24
相关论文
共 33 条
  • [31] MOEPGA: A novel method to detect protein complexes in yeast protein-protein interaction networks based on Multi Objective Evolutionary Programming Genetic Algorithm
    Cao, Buwen
    Luo, Jiawei
    Liang, Cheng
    Wang, Shulin
    Song, Dan
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 58 : 173 - 181
  • [32] A New Sequential Forward Feature Selection (SFFS) Algorithm for Mining Best Topological and Biological Features to Predict Protein Complexes from Protein–Protein Interaction Networks (PPINs)
    Haseeb Younis
    Muhammad Waqas Anwar
    Muhammad Usman Ghani Khan
    Aisha Sikandar
    Usama Ijaz Bajwa
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 371 - 388
  • [33] A New Sequential Forward Feature Selection (SFFS) Algorithm for Mining Best Topological and Biological Features to Predict Protein Complexes from Protein-Protein Interaction Networks (PPINs)
    Younis, Haseeb
    Anwar, Muhammad Waqas
    Khan, Muhammad Usman Ghani
    Sikandar, Aisha
    Bajwa, Usama Ijaz
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 371 - 388