共 42 条
An Algorithm to Solve the Motif Alignment Problem for Approximate Nested Tandem Repeats in Biological Sequences
被引:3
|作者:
Matroud, Atheer A.
[1
]
Tuffley, Christopher P.
[1
]
Hendy, Michael D.
[2
]
机构:
[1] Massey Univ, Inst Fundamental Sci, Palmerston North 4442, Manawatu, New Zealand
[2] Univ Otago, Dept Math & Stat, Dunedin, New Zealand
关键词:
algorithms;
alignment;
molecular evolution;
satellites;
simple sequence repeats;
D O I:
10.1089/cmb.2011.0101
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
An approximate nested tandem repeat (NTR) in a string T is a complex repetitive structure consisting of many approximate copies of two substrings x and X ("motifs") interspersed with one another. NTRs fall into a class of repetitive structures broadly known as subrepeats. NTRs have been found in real DNA sequences and are expected to be important in evolutionary biology, both in understanding evolution of the ribosomal DNA (where NTRs can occur), and as a potential marker in population genetic and phylogenetic studies. This article describes an alignment algorithm for the verification phase of the software tool NTRFinder developed for database searches for NTRs. When the search algorithm has located a subsequence containing a possible NTR, with motifs X and x, a verification step aligns this subsequence against an exact NTR built from the templates X and x, to determine whether the subsequence contains an approximate NTR and its extent. This article describes an algorithm to solve this alignment problem in O(vertical bar T vertical bar(vertical bar X vertical bar + vertical bar x vertical bar)) space and time. The algorithm is based on Fischetti et al.'s wrap-around dynamic programming.
引用
收藏
页码:1211 / 1218
页数:8
相关论文