Ab initio detection of fuzzy amino acid tandem repeats in protein sequences

被引:15
|
作者
Pellegrini, Marco [1 ]
Renda, Maria Elena [1 ]
Vecchio, Alessio [2 ]
机构
[1] CNR, Ist Informat & Telemat, I-56124 Pisa, Italy
[2] Univ Pisa, Dipartimento Ingn Informaz, I-56122 Pisa, Italy
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
MULTIPLE SPACED SEEDS; IDENTIFICATION; ALGORITHM; ALIGNMENT;
D O I
10.1186/1471-2105-13-S3-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats). Results: In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure. Conclusions: PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Low-complexity sequences and single amino acid repeats: not just "junk'' peptide sequences
    Haerty, Wilfried
    Golding, G. Brian
    GENOME, 2010, 53 (10) : 753 - 762
  • [22] Comprehensive analysis of tandem amino acid repeats from ten angiosperm genomes
    Yuan Zhou
    Jing Liu
    Lei Han
    Zhi-Gang Li
    Ziding Zhang
    BMC Genomics, 12
  • [23] Natural selection drives the accumulation of amino acid tandem repeats in human proteins
    Mularoni, Loris
    Ledda, Alice
    Toll-Riera, Macarena
    Mar Alba, M.
    GENOME RESEARCH, 2010, 20 (06) : 745 - 754
  • [24] Comprehensive analysis of tandem amino acid repeats from ten angiosperm genomes
    Zhou, Yuan
    Liu, Jing
    Han, Lei
    Li, Zhi-Gang
    Zhang, Ziding
    BMC GENOMICS, 2011, 12
  • [25] Detection of protein coding sequences using a mixture model for local protein amino acid sequence
    Thayer, EC
    Bystroff, C
    Baker, D
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) : 317 - 327
  • [26] Detection of significant patterns by compression algorithms: The case of approximate tandem repeats in DNA sequences
    Rivals, E
    Delgrange, O
    Delahaye, JP
    Dauchet, M
    Delorme, MO
    Henaut, A
    Ollivier, E
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1997, 13 (02): : 131 - 136
  • [27] Tandem Repeats Detection in DNA Sequences Using P-Spectrum Based Algorithm
    Garg, Pardeep
    Sharma, Sunildatt
    Sharma, Sanjeev Narayan
    2017 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [28] REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences
    Kamel, Mohamed
    Kastano, Kristina
    Mier, Pablo
    Andrade-Navarro, Miguel A.
    JOURNAL OF MOLECULAR BIOLOGY, 2021, 433 (11)
  • [29] Highly constrained proteins contain an unexpectedly large number of amino acid tandem repeats
    Mularoni, Loris
    Veitia, Reiner A.
    Alba, M. Mar
    GENOMICS, 2007, 89 (03) : 316 - 325
  • [30] Detection of Tandem Repeats in DNA Sequences Using Short-Time Ramanujan Fourier Transform
    Yadav, Yashpal
    Sharma, Sanjeev Narayan
    Shakya, Devendra Kumar
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1583 - 1591