DNA sequences;
supermaximal repeats;
suffix tree;
performance;
D O I:
暂无
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
We study the problem of finding supermaximal repeats in large DNA sequences. For this, we propose an algorithm called SMR which uses an auxiliary index structure (POL), which is derived from and replaces the suffix tree index ST-FD64 [1]. The results of our numerous experiments using the 24 human chromosomes data indicate that SMR outperforms the solution provided as part of the Vmatch [2] software tool. In searching for supermaximal repeats of size at least 10 bases, SMR is twice faster than Vmatch; for a minimum length of 25 bases, SMR is 7 times faster; and for repeats of length at least 200, SMR is about 9 times faster. We also study the cost of POL in terms of time and space requirements.
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Molecular Biology and Genetic Engineering Laboratory, Defence Institute of Bio Energy Research, GoraparaoUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Grover A.
Aishwarya V.
论文数: 0引用数: 0
h-index: 0
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Division of Hematology/Oncology, Department of Medicine, University of Pennsylvania School of Medicine, Philadelphia, PAUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Aishwarya V.
Sharma P.C.
论文数: 0引用数: 0
h-index: 0
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C DwarkaUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka