SimSearch: A New Variant of Dynamic Programming Based on Distance Series for Optimal and Near-Optimal Similarity Discovery in Biological Sequences

被引:0
|
作者
Deusdado, Sergio A. D. [1 ]
Carvalho, Paulo M. M. [2 ]
机构
[1] Polytech Inst Braganca, ESA, P-5300 Braganca, Portugal
[2] Univ Minho, Sch Engn, Dept Informat, P-4710 Braga, Portugal
来源
2ND INTERNATIONAL WORKSHOP ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (IWPACBB 2008) | 2009年 / 49卷
关键词
Similarity discovery; dynamic programming; distance series; EFFICIENT; SEARCH;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose SimSearch, an algorithm implementing a new variant of dynamic programming based on distance series for optimal and near-optimal similarity discovery in biological sequences. The initial phase of SimSearch is devoted to fulfil the binary similarity matrices by signalling the distances between occurrences of the same symbol. The scoring scheme is further applied, when analysed the maximal extension of the pattern. Employing bit parallelism to analyse the global similarity matrix's upper triangle, the new methodology searches the sequence(s) for all the exact and approximate patterns in regular or reverse order. The algorithm accepts parameterization to work with greater seeds for near-optimal results. Performance tests show significant efficiency improvement over traditional optimal methods based on dynamic programming. Comparing the new algorithm's efficiency against heuristic based methods, equalizing the required sensitivity, the proposed algorithm remains acceptable.
引用
收藏
页码:206 / +
页数:3
相关论文
共 31 条
  • [31] Optimal water allocation integrated with water supply, replenishment, and spill in the in-series reservoir based on an improved decomposition and dynamic programming aggregation method
    Xu, Zuping
    Gong, Zhihao
    Cheng, Haomiao
    Cheng, Jilin
    JOURNAL OF HYDROINFORMATICS, 2023, 25 (03) : 989 - 1003