Fast two-dimensional approximate pattern matching

被引:0
|
作者
BaezaYates, R [1 ]
Navarro, G [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
关键词
D O I
10.1007/BFb0054334
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We address the problem of approximate string matching in two dimensions, that is, to find a pattern of size m x m in a text of size n x n with at most k errors (substitutions, insertions and deletions). Although the problem can be solved using dynamic programming in time O(m(2)n(2)), this is in general too expensive for small L. So we design a filtering algorithm which avoids verifying most of the text with dynamic programming. This filter is based on a one-dimensional multi-pattern approximate search algorithm. The average complexity of our resulting algorithm is O(n(2)k log(sigma) m /m(2)) for k < m(m + 1)/(5 log(sigma) m), which is optimal and matches the best previous result which allows only substitutions. For higher error levels, we present an algorithm with time complexity O(n(2)k/(w root sigma) (where w is the size in bits of the computer word and sigma is the alphabet size). This algorithm works for k < m(m+1)(1-e/root sigma), where e = 2.718..., a limit which is not possible to improve. These are the first good expected-case algorithms for the problem. Our algorithms work also for rectangular patterns and rectangular text and can even be extended to the case where each row in the pattern and the text has a different length.
引用
收藏
页码:341 / 351
页数:11
相关论文
共 50 条
  • [21] Point pattern matching in the analysis of two-dimensional gel electropherograms
    Pánek, J
    Vohradsky, J
    ELECTROPHORESIS, 1999, 20 (18) : 3483 - 3491
  • [22] ON TWO-DIMENSIONAL PATTERN-MATCHING LANGUAGES AND THEIR DECISION-PROBLEMS
    MORITA, K
    NAKAZONO, K
    SUGATA, K
    INFORMATION SCIENCES, 1986, 40 (01) : 53 - 66
  • [23] A constant time optimal parallel algorithm for two-dimensional pattern matching
    Crochemore, M
    Gasieniec, L
    Hariharan, R
    Muthukrishnan, S
    Rytter, W
    SIAM JOURNAL ON COMPUTING, 1998, 27 (03) : 668 - 681
  • [24] Two-Dimensional Parameterized Matching
    Cole, Richard
    Hazay, Carmit
    Lewenstein, Moshe
    Tsur, Dekel
    ACM TRANSACTIONS ON ALGORITHMS, 2014, 11 (02) : 1 - 30
  • [25] Smart Templates for peak pattern matching with comprehensive two-dimensional liquid chromatography
    Reichenbach, Stephen E.
    Carr, Peter W.
    Stoll, Dwight R.
    Tao, Qingping
    JOURNAL OF CHROMATOGRAPHY A, 2009, 1216 (16) : 3458 - 3466
  • [26] A combinatorial geometrical approach to two-dimensional robust pattern matching with scaling and rotation
    Hundt, Christian
    Liskiewicz, Maciej
    Nevries, Ragnar
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (51) : 5317 - 5333
  • [27] Application of Approximate Pattern Matching in Two Dimensional Spaces to Grid Layout for Biochemical Network Maps
    Inoue, Kentaro
    Shimozono, Shinichi
    Yoshida, Hideaki
    Kurata, Hiroyuki
    PLOS ONE, 2012, 7 (06):
  • [28] Two-dimensional dynamic dictionary matching
    Choi, Y
    Lam, TW
    ALGORITHMS AND COMPUTATION, 1996, 1178 : 85 - 94
  • [29] Optimal two-dimensional compressed matching
    Amir, A
    Benson, G
    Farach, M
    JOURNAL OF ALGORITHMS, 1997, 24 (02) : 354 - 379
  • [30] Optimal Two-Dimensional Compressed Matching
    College of Computing, Georgia Institute of Technology, Atlanta, GA 30332-0280, United States
    不详
    不详
    J Algorithms, 2 (354-379):