RRW: repeated random walks on genome-scale protein networks for local cluster discovery

被引:136
|
作者
Macropol, Kathy [1 ]
Can, Tolga [2 ]
Singh, Ambuj K. [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[2] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
COMPLEXES; ALGORITHMS; MODULES;
D O I
10.1186/1471-2105-10-283
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e. g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. Results: We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL), and find a significant improvement in the RRW clusters' precision and accuracy values. Conclusion: RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multifunctional proteins by allowing overlapping clusters.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Genome-Scale Modeling of the Protein Secretory Machinery in Yeast
    Feizi, Amir
    Osterlund, Tobias
    Petranovic, Dina
    Bordel, Sergio
    Nielsen, Jens
    PLOS ONE, 2013, 8 (05):
  • [22] Expanding the uses of genome-scale models with protein structures
    Mih, Nathan
    Palsson, Bernhard O.
    MOLECULAR SYSTEMS BIOLOGY, 2019, 15 (11)
  • [23] InterProScan 5: genome-scale protein function classification
    Jones, Philip
    Binns, David
    Chang, Hsin-Yu
    Fraser, Matthew
    Li, Weizhong
    McAnulla, Craig
    McWilliam, Hamish
    Maslen, John
    Mitchell, Alex
    Nuka, Gift
    Pesseat, Sebastien
    Quinn, Antony F.
    Sangrador-Vegas, Amaia
    Scheremetjew, Maxim
    Yong, Siew-Yit
    Lopez, Rodrigo
    Hunter, Sarah
    BIOINFORMATICS, 2014, 30 (09) : 1236 - 1240
  • [24] Feature set optimization in biomarker discovery from genome-scale data
    Fortino, V.
    Scala, G.
    Greco, D.
    BIOINFORMATICS, 2020, 36 (11) : 3393 - 3400
  • [25] cisRED: a database system for genome-scale computational discovery of regulatory elements
    Robertson, G.
    Bilenky, M.
    Lin, K.
    He, A.
    Yuen, W.
    Dagpinar, M.
    Varhol, R.
    Teague, K.
    Griffith, O. L.
    Zhang, X.
    Pan, Y.
    Hassel, M.
    Sleumer, M. C.
    Pan, W.
    Pleasance, E. D.
    Chuang, M.
    Hao, H.
    Li, Y. Y.
    Robertson, N.
    Fjell, C.
    Li, B.
    Montgomery, S. B.
    Astakhova, T.
    Zhou, J.
    Sander, J.
    Siddiqui, A. S.
    Jones, S. J. M.
    NUCLEIC ACIDS RESEARCH, 2006, 34 : D68 - D73
  • [26] Genome-scale metabolic models as platforms for strain design and biological discovery
    Mienda, Bashir Sajo
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (09): : 1863 - 1873
  • [27] Integrating gene and protein expression data with genome-scale metabolic networks to infer functional pathways
    Pey, Jon
    Valgepea, Kaspar
    Rubio, Angel
    Beasley, John E.
    Planes, Francisco J.
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [28] Toward the automated generation of genome-scale metabolic networks in the SEED
    DeJongh, Matthew
    Formsma, Kevin
    Boillot, Paul
    Gould, John
    Rycenga, Matthew
    Best, Aaron
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [29] Environmental versatility promotes modularity in genome-scale metabolic networks
    Samal, Areejit
    Wagner, Andreas
    Martin, Olivier C.
    BMC SYSTEMS BIOLOGY, 2011, 5
  • [30] Reverse engineering and analysis of large genome-scale gene networks
    Aluru, Maneesha
    Zola, Jaroslaw
    Nettleton, Dan
    Aluru, Srinivas
    NUCLEIC ACIDS RESEARCH, 2013, 41 (01)