An Algorithm to Solve the Motif Alignment Problem for Approximate Nested Tandem Repeats in Biological Sequences

被引:3
|
作者
Matroud, Atheer A. [1 ]
Tuffley, Christopher P. [1 ]
Hendy, Michael D. [2 ]
机构
[1] Massey Univ, Inst Fundamental Sci, Palmerston North 4442, Manawatu, New Zealand
[2] Univ Otago, Dept Math & Stat, Dunedin, New Zealand
关键词
algorithms; alignment; molecular evolution; satellites; simple sequence repeats;
D O I
10.1089/cmb.2011.0101
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An approximate nested tandem repeat (NTR) in a string T is a complex repetitive structure consisting of many approximate copies of two substrings x and X ("motifs") interspersed with one another. NTRs fall into a class of repetitive structures broadly known as subrepeats. NTRs have been found in real DNA sequences and are expected to be important in evolutionary biology, both in understanding evolution of the ribosomal DNA (where NTRs can occur), and as a potential marker in population genetic and phylogenetic studies. This article describes an alignment algorithm for the verification phase of the software tool NTRFinder developed for database searches for NTRs. When the search algorithm has located a subsequence containing a possible NTR, with motifs X and x, a verification step aligns this subsequence against an exact NTR built from the templates X and x, to determine whether the subsequence contains an approximate NTR and its extent. This article describes an algorithm to solve this alignment problem in O(vertical bar T vertical bar(vertical bar X vertical bar + vertical bar x vertical bar)) space and time. The algorithm is based on Fischetti et al.'s wrap-around dynamic programming.
引用
收藏
页码:1211 / 1218
页数:8
相关论文
共 42 条
  • [1] An Algorithm to Solve the Motif Alignment Problem for Approximate Nested Tandem Repeats
    Matroud, Atheer A.
    Hendy, Michael D.
    Tuffley, Christopher P.
    COMPARATIVE GENOMICS, 2010, 6398 : 188 - +
  • [2] An algorithm for approximate tandem repeats
    Landau, GM
    Schmidt, JP
    Sokol, D
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (01) : 1 - 18
  • [3] Finding approximate tandem repeats in genomic sequences
    Wexler, Y
    Yakhini, Z
    Kashi, Y
    Geiger, D
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2005, 12 (07) : 928 - 942
  • [4] STAR: An algorithm to search for tandem approximate repeats
    Delgrange, O
    Rivals, E
    BIOINFORMATICS, 2004, 20 (16) : 2812 - 2820
  • [5] Locating Tandem Repeats in Weighted Biological Sequences
    Zhang, Hui
    Guo, Qing
    Iliopoulos, Costas S.
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 118 - +
  • [6] An Approximate Algorithm to Solve Facility Layout Problem
    Singh, S. P.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 187 - 192
  • [7] MGWT based Algorithm for Tandem Repeats Detection in DNA Sequences
    Garg, Pardeep
    Sharma, SunilDatt
    PROCEEDINGS OF 2019 5TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC 2K19), 2019, : 196 - 199
  • [8] AN APPROXIMATE ALGORITHM FOR THE MINIMAL VERTEX NESTED POLYGON PROBLEM
    DASGUPTA, B
    MADHAVAN, CEV
    INFORMATION PROCESSING LETTERS, 1989, 33 (01) : 35 - 44
  • [9] An Approximation Algorithm for Alignment of Multiple Sequences using Motif Discovery
    Laxmi Parida
    Aris Floratos
    Isidore Rigoutsos
    Journal of Combinatorial Optimization, 1999, 3 : 247 - 275
  • [10] An approximation algorithm for alignment of multiple sequences using motif discovery
    Parida, L
    Floratos, A
    Rigoutsos, I
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 1999, 3 (2-3) : 247 - 275