SMETANA: Accurate and Scalable Algorithm for Probabilistic Alignment of Large-Scale Biological Networks

被引:71
|
作者
Sahraeian, Sayed Mohammad Ebrahim [1 ]
Yoon, Byung-Jun [2 ]
机构
[1] Univ Calif, Dept Plant & Microbial Biol, Berkeley, CA USA
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
来源
PLOS ONE | 2013年 / 8卷 / 07期
基金
美国国家科学基金会;
关键词
PROTEIN-INTERACTION NETWORKS; GLOBAL ALIGNMENT; SYSTEMATIC IDENTIFICATION; CONSERVED PATHWAYS; SIMILARITY; DATABASE; YEAST;
D O I
10.1371/journal.pone.0067995
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we introduce an efficient algorithm for alignment of multiple large-scale biological networks. In this scheme, we first compute a probabilistic similarity measure between nodes that belong to different networks using a semi-Markov random walk model. The estimated probabilities are further enhanced by incorporating the local and the cross-species network similarity information through the use of two different types of probabilistic consistency transformations. The transformed alignment probabilities are used to predict the alignment of multiple networks based on a greedy approach. We demonstrate that the proposed algorithm, called SMETANA, outperforms many state-of-the-art network alignment techniques, in terms of computational efficiency, alignment accuracy, and scalability. Our experiments show that SMETANA can easily align tens of genome-scale networks with thousands of nodes on a personal computer without any difficulty. The source code of SMETANA is available upon request. The source code of SMETANA can be downloaded from http://www.ece.tamu.edu/similar to bjyoon/SMETANA/.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] SparkSW: scalable distributed computing system for large-scale biological sequence alignment
    Zhao, Guoguang
    Ling, Cheng
    Sun, Donghong
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 845 - 852
  • [2] Probabilistic queries in large-scale networks
    Pedone, F
    Duarte, NL
    Goulart, M
    DEPENDABLE COMPUTING: EDCC-4, PROCEEDINGS, 2002, 2485 : 209 - 226
  • [3] FAST NETWORK QUERYING ALGORITHM FOR SEARCHING LARGE-SCALE BIOLOGICAL NETWORKS
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 6008 - 6011
  • [4] Scalable probabilistic PCA for large-scale genetic variation data
    Agrawal, Aman
    Chiu, Alec M.
    Le, Minh
    Halperin, Eran
    Sankararaman, Sriram
    PLOS GENETICS, 2020, 16 (05):
  • [5] Sequence progressive alignment, a framework for practical large-scale probabilistic consistency alignment
    Paten, Benedict
    Herrero, Javier
    Beal, Kathryn
    Birney, Ewan
    BIOINFORMATICS, 2009, 25 (03) : 295 - 301
  • [6] Scalable and Secure Routing for Large-Scale Sensor Networks
    Tang, Feilong
    Guo, Minyi
    Li, Minglu
    Wang, Zhijun
    Cheng, Zixue
    EUC 2008: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING, VOL 2, WORKSHOPS, 2008, : 300 - +
  • [7] Large-Scale Parallel Alignment Algorithm for SMRT Reads
    Xia, Zeyu
    Cui, Yingbo
    Zhang, Ang
    Zhang, Peng
    Long, Sifan
    Tang, Tao
    Peng, Lin
    Huang, Chun
    Yang, Canqun
    Liao, Xiangke
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT II, 2022, 13156 : 213 - 229
  • [8] A PARALLEL ALGORITHM FOR LARGE-SCALE MULTIPLE SEQUENCE ALIGNMENT
    Lopes, Heitor S.
    Erig Lima, Carlos R.
    Moritz, Guilherme L.
    COMPUTING AND INFORMATICS, 2010, 29 (06) : 1233 - 1250
  • [9] LinkProbe: Probabilistic Inference on Large-Scale Social Networks
    Chen, Haiquan
    Ku, Wei-Shinn
    Wang, Haixun
    Tang, Liang
    Sun, Min-Te
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 290 - 301
  • [10] Stochastic block coordinate Frank-Wolfe algorithm for large-scale biological network alignment
    Wang, Yijie
    Qian, Xiaoning
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2016, Springer Verlag (01)