Online Pattern Matching for String Edit Distance with Moves

被引:0
|
作者
Takabatake, Yoshimasa [1 ]
Tabei, Yasuo [2 ]
Sakamoto, Hiroshi [1 ]
机构
[1] Kyushu Inst Technol, Kitakyushu, Fukuoka, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Tokyo 1028666, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edit distance with moves (EDM) is a string-to-string distance measure that includes substring moves in addition to ordinal editing operations to turn one string to the other. Although optimizing EDM is intractable, it has many applications especially in error detections. Edit sensitive parsing (ESP) is an efficient parsing algorithm that guarantees an upper bound of parsing discrepancies between different appearances of the same substrings in a string. ESP can be used for computing an approximate EDM as the L-1 distance between characteristic vectors built by node labels in parsing trees. However, ESP is not applicable to a streaming text data where a whole text is unknown in advance. We present an online ESP (OESP) that enables an online pattern matching for EDM. OESP builds a parse tree for a streaming text and computes the L-1 distance between characteristic vectors in an online manner. For the space-efficient computation of EDM, OESP directly encodes the parse tree into a succinct representation by leveraging the idea behind recent results of a dynamic succinct tree. We experimentally test OESP on the ability to compute EDM in an online manner on benchmark datasets, and we show OESP's efficiency.
引用
收藏
页码:203 / 214
页数:12
相关论文
共 50 条
  • [41] Compressed String Dictionary Search with Edit Distance One
    Belazzougui, Djamal
    Venturini, Rossano
    ALGORITHMICA, 2016, 74 (03) : 1099 - 1122
  • [42] Compressed String Dictionary Search with Edit Distance One
    Djamal Belazzougui
    Rossano Venturini
    Algorithmica, 2016, 74 : 1099 - 1122
  • [43] Contour Regularity Extraction Based on String Edit Distance
    Salas, Jose Ignacio Abreu
    Ramon Rico-Juan, Juan
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 160 - +
  • [44] Faster Privacy-Preserving Computation of Edit Distance with Moves
    Yoshimoto, Yohei
    Kataoka, Masaharu
    Takabatake, Yoshimasa
    Tomohiro, I
    Shin, Kilho
    Sakamoto, Hiroshi
    WALCOM: ALGORITHMS AND COMPUTATION (WALCOM 2020), 2020, 12049 : 308 - 320
  • [45] Inexact tree pattern matching with 1-degree edit distance using finite automata
    Sestakova, Eliska
    Guth, Ondrej
    Janousek, Jan
    DISCRETE APPLIED MATHEMATICS, 2023, 330 : 78 - 97
  • [46] Computing the cyclic edit distance for pattern classification by ranking edit paths
    Jiménez, VM
    Marzal, A
    Palazón, V
    Peris, G
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 125 - 133
  • [47] Bipartite graph matching for computing the edit distance of graphs
    Riesen, Kaspar
    Neuhaus, Michel
    Bunke, Horst
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4538 : 1 - +
  • [48] Metamodel Matching Based on Planar Graph Edit Distance
    Voigt, Konrad
    Heinze, Thomas
    THEORY AND PRACTICE OF MODEL TRANSFORMATIONS, 2010, 6142 : 245 - 259
  • [49] Approximation of graph edit distance based on Hausdorff matching
    Fischer, Andreas
    Suen, Ching Y.
    Frinken, Volkmar
    Riesen, Kaspar
    Bunke, Horst
    PATTERN RECOGNITION, 2015, 48 (02) : 331 - 343
  • [50] Shape matching using edit-distance: an implementation
    Klein, PN
    Sebastian, TB
    Kimia, BB
    PROCEEDINGS OF THE TWELFTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2001, : 781 - 790