Online Pattern Matching for String Edit Distance with Moves

被引:0
|
作者
Takabatake, Yoshimasa [1 ]
Tabei, Yasuo [2 ]
Sakamoto, Hiroshi [1 ]
机构
[1] Kyushu Inst Technol, Kitakyushu, Fukuoka, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Tokyo 1028666, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edit distance with moves (EDM) is a string-to-string distance measure that includes substring moves in addition to ordinal editing operations to turn one string to the other. Although optimizing EDM is intractable, it has many applications especially in error detections. Edit sensitive parsing (ESP) is an efficient parsing algorithm that guarantees an upper bound of parsing discrepancies between different appearances of the same substrings in a string. ESP can be used for computing an approximate EDM as the L-1 distance between characteristic vectors built by node labels in parsing trees. However, ESP is not applicable to a streaming text data where a whole text is unknown in advance. We present an online ESP (OESP) that enables an online pattern matching for EDM. OESP builds a parse tree for a streaming text and computes the L-1 distance between characteristic vectors in an online manner. For the space-efficient computation of EDM, OESP directly encodes the parse tree into a succinct representation by leveraging the idea behind recent results of a dynamic succinct tree. We experimentally test OESP on the ability to compute EDM in an online manner on benchmark datasets, and we show OESP's efficiency.
引用
收藏
页码:203 / 214
页数:12
相关论文
共 50 条
  • [21] Learning string-edit distance
    Ristad, ES
    Yianilos, PN
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (05) : 522 - 532
  • [22] Approximating Tree Edit Distance through String Edit Distance for Binary Tree Codes
    Aratsu, Taku
    Hirata, Kouichi
    Kuboyama, Tetsuji
    FUNDAMENTA INFORMATICAE, 2010, 101 (03) : 157 - 171
  • [23] Approximating Tree Edit Distance through String Edit Distance for Binary Tree Codes
    Aratsu, Taku
    Hirata, Kouichi
    Kuboyama, Tetsuji
    SOFSEM 2009-THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2009, 5404 : 93 - +
  • [24] Block edit models for approximate string matching
    Lopresti, D
    Tomkins, A
    THEORETICAL COMPUTER SCIENCE, 1997, 181 (01) : 159 - 179
  • [25] Block edit models for approximate string matching
    Panasonic Technologies, Inc, Princeton, United States
    Theor Comput Sci, 1 (159-179):
  • [26] Graph node matching for edit distance
    Moscatelli, Aldo
    Piquenot, Jason
    Berar, Maxime
    Heroux, Pierre
    Adam, Sebastien
    PATTERN RECOGNITION LETTERS, 2024, 184 : 14 - 20
  • [27] A k-mismatch string matching for generalized edit distance using diagonal skipping method
    Kim, HyunJin
    PLOS ONE, 2021, 16 (05):
  • [28] Explaining Propagators for String Edit Distance Constraints
    Winter, Felix
    Muslin, Nysret
    Stuckey, Peter J.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1676 - 1683
  • [29] Classes of cost functions for string edit distance
    S. V. Rice
    H. Bunke
    T. A. Nartker
    Algorithmica, 1997, 18 : 271 - 280
  • [30] Oblivious String Embeddings and Edit Distance Approximations
    Batu, Tugkan
    Ergun, Funda
    Sahinalp, Cenk
    PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2006, : 792 - 801