Efficient protein alignment algorithm for protein search

被引:1
|
作者
Lu, Zaixin [1 ]
Zhao, Zhiyu [2 ]
Fu, Bin [1 ]
机构
[1] Univ Texas Pan Amer, Dept Comp Sci, Edinburg, TX 78539 USA
[2] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
基金
美国国家科学基金会;
关键词
STRUCTURAL ALIGNMENT; SCOP DATABASE; SIMILARITIES; GROWTH;
D O I
10.1186/1471-2105-11-S1-S34
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm itself may be insufficient for effective discovering of structural relationships among tens of thousands of proteins. Due to the exponentially increasing amount of protein structural data, a fast and accurate structure alignment tool is necessary to access protein classification and protein similarity search; however, the complexity of current alignment algorithms are usually too high to make a fully alignment-based classification and search practical. Results: We have developed an efficient protein pairwise alignment algorithm and applied it to our protein search tool, which aligns a query protein structure in the pairwise manner with all protein structures in the Protein Data Bank (PDB) to output similar protein structures. The algorithm can align hundreds of pairs of protein structures in one second. Given a protein structure, the tool efficiently discovers similar structures from tens of thousands of structures stored in the PDB always in 2 minutes in a single machine and 20 seconds in our cluster of 6 machines. The algorithm has been fully implemented and is accessible online at our webserver, which is supported by a cluster of computers. Conclusion: Our algorithm can work out hundreds of pairs of protein alignments in one second. Therefore, it is very suitable for protein search. Our experimental results show that it is more accurate than other well known protein search systems in finding proteins which are structurally similar at SCOP family and superfamily levels, and its speed is also competitive with those systems. In terms of the pairwise alignment performance, it is as good as some well known alignment algorithms.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Protein alignment algorithms with an efficient backtracking routine on multiple GPUs
    Jacek Blazewicz
    Wojciech Frohmberg
    Michal Kierzynka
    Erwin Pesch
    Pawel Wojciechowski
    BMC Bioinformatics, 12
  • [42] Efficient Protein Structure Alignment Algorithms under the MapReduce Framework
    Hung, Che-Lun
    Lin, Yaw-Ling
    Hsieh, Chen-En
    Hua, Guan-Jie
    2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [43] A new alignment-independent algorithm for clustering protein sequences
    Kelil, Abdellali
    Wang, Shengrui
    Brzezinski, Ryszard
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 27 - +
  • [44] Protein alignment algorithms with an efficient backtracking routine on multiple GPUs
    Blazewicz, Jacek
    Frohmberg, Wojciech
    Kierzynka, Michal
    Pesch, Erwin
    Wojciechowski, Pawel
    BMC BIOINFORMATICS, 2011, 12
  • [45] An improved protein sequence alignment algorithm scores pairs of substitutions
    Kilinc, Mesih
    Jia, Kejue
    Jernigan, Robert L.
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 180A - 180A
  • [46] Feedback algorithm and web-server for protein structure alignment
    Zhao, Zhiyu
    Fu, Bin
    Alanis, Francisco J.
    Summa, Christopher M.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (05) : 505 - 524
  • [47] FLEXIBLE ALGORITHM FOR DIRECT MULTIPLE ALIGNMENT OF PROTEIN STRUCTURES AND SEQUENCES
    GODZIK, A
    SKOLNICK, J
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1994, 10 (06): : 587 - 596
  • [48] A parallel hybrid genetic algorithm for multiple protein sequence alignment
    Nguyen, HD
    Yoshihara, I
    Yamamori, K
    Yasunaga, M
    CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 309 - 314
  • [49] Protein sequence alignment based on fuzzy arithmetic and Genetic algorithm
    Chang, Ping-Teng
    Hung, Lung-Ting
    Lin, Kuo-Ping
    Lin, Chih-Sheng
    Hung, Kuo-Chen
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 1362 - +
  • [50] Algorithm engineering for optimal alignment of protein structure distance matrices
    Wohlers, Inken
    Andonov, Rumen
    Klau, Gunnar W.
    OPTIMIZATION LETTERS, 2011, 5 (03) : 421 - 433