Online Pattern Matching for String Edit Distance with Moves

被引:0
|
作者
Takabatake, Yoshimasa [1 ]
Tabei, Yasuo [2 ]
Sakamoto, Hiroshi [1 ]
机构
[1] Kyushu Inst Technol, Kitakyushu, Fukuoka, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Tokyo 1028666, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edit distance with moves (EDM) is a string-to-string distance measure that includes substring moves in addition to ordinal editing operations to turn one string to the other. Although optimizing EDM is intractable, it has many applications especially in error detections. Edit sensitive parsing (ESP) is an efficient parsing algorithm that guarantees an upper bound of parsing discrepancies between different appearances of the same substrings in a string. ESP can be used for computing an approximate EDM as the L-1 distance between characteristic vectors built by node labels in parsing trees. However, ESP is not applicable to a streaming text data where a whole text is unknown in advance. We present an online ESP (OESP) that enables an online pattern matching for EDM. OESP builds a parse tree for a streaming text and computes the L-1 distance between characteristic vectors in an online manner. For the space-efficient computation of EDM, OESP directly encodes the parse tree into a succinct representation by leveraging the idea behind recent results of a dynamic succinct tree. We experimentally test OESP on the ability to compute EDM in an online manner on benchmark datasets, and we show OESP's efficiency.
引用
收藏
页码:203 / 214
页数:12
相关论文
共 50 条
  • [1] The String Edit Distance Matching Problem With Moves
    Cormode, Graham
    Muthukrishnan, S.
    ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (01)
  • [2] The string edit distance matching problem with moves
    Cormode, G
    Muthukrishnan, S
    PROCEEDINGS OF THE THIRTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2002, : 667 - 676
  • [3] Privacy-Preserving String Edit Distance with Moves
    Nakagawa, Shunta
    Sakamoto, Tokio
    Takabatake, Yoshimasa
    Tomohiro, I
    Shin, Kilho
    Sakamoto, Hiroshi
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2018, 2018, 11223 : 226 - 240
  • [4] String edit distance, random walks and graph matching
    Robles-Kelly, A
    Hancock, ER
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (03) : 315 - 327
  • [5] siEDM: An Efficient String Index and Search Algorithm for Edit Distance with Moves
    Takabatake, Yoshimasa
    Nakashima, Kenta
    Kuboyama, Tetsuji
    Tabei, Yasuo
    Sakamoto, Hiroshi
    ALGORITHMS, 2016, 9 (02)
  • [6] Online signature verification based on string edit distance
    Riesen, Kaspar
    Schmidt, Roman
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (01) : 41 - 54
  • [7] Online signature verification based on string edit distance
    Kaspar Riesen
    Roman Schmidt
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 41 - 54
  • [8] Graph matching using spectral seriation and string edit distance
    Robles-Kelly, A
    Hancock, ER
    GRAPH BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2726 : 154 - 165
  • [9] Approximate Circular Pattern Matching Under Edit Distance
    Charalampopoulos, Panagiotis
    Pissis, Solon P.
    Radoszewski, Jakub
    Rytter, Wojciech
    Walen, Tomasz
    Zuba, Wiktor
    41ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE, STACS 2024, 2024, 289
  • [10] Using edit distance in point-pattern matching
    Mäkinen, V
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 153 - 161