Computing the Expected Edit Distance from a String to a PFA

被引:1
|
作者
Calvo-Zaragoza, Jorge [1 ]
de la Higuera, Colin [2 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, DLSI, Alicante, Spain
[2] Univ Nantes, LINA Lab, UMR 6241, Nantes, France
关键词
Edit distance; Probabilistic finite state automata;
D O I
10.1007/978-3-319-40946-7_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In a number of fields one is to compare a witness string with a distribution. One possibility is to compute the probability of the string for that distribution. Another, giving a more global view, is to compute the expected edit distance from a string randomly drawn to the witness string. This number is often used to measure the performance of a prediction, the goal then being to return the median string, or the string with smallest expected distance. To be able to measure this, computing the distance between a hypothesis and that distribution is necessary. This paper proposes two solutions for computing this value, when the distribution is defined with a probabilistic finite state automaton. The first is exact but has a cost which can be exponential in the length of the input string, whereas the second is a FPRAS.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
  • [21] Computing the edit distance of a regular language
    Konstantinidis, Stavros
    INFORMATION AND COMPUTATION, 2007, 205 (09) : 1307 - 1316
  • [22] Edit distance for a run-length-encoded string and an uncompressed string
    Liu, J. J.
    Huang, G. S.
    Wang, Y. L.
    Lee, R. C. T.
    INFORMATION PROCESSING LETTERS, 2007, 105 (01) : 12 - 16
  • [23] Online Pattern Matching for String Edit Distance with Moves
    Takabatake, Yoshimasa
    Tabei, Yasuo
    Sakamoto, Hiroshi
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 203 - 214
  • [24] Online signature verification based on string edit distance
    Riesen, Kaspar
    Schmidt, Roman
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (01) : 41 - 54
  • [25] Computing the cyclic edit distance for pattern classification by ranking edit paths
    Jiménez, VM
    Marzal, A
    Palazón, V
    Peris, G
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 125 - 133
  • [26] An algorithm for string edit distance allowing substring reversals
    Arslan, Abdullah N.
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 220 - +
  • [27] Compressed String Dictionary Search with Edit Distance One
    Belazzougui, Djamal
    Venturini, Rossano
    ALGORITHMICA, 2016, 74 (03) : 1099 - 1122
  • [28] Compressed String Dictionary Search with Edit Distance One
    Djamal Belazzougui
    Rossano Venturini
    Algorithmica, 2016, 74 : 1099 - 1122
  • [29] String edit distance, random walks and graph matching
    Robles-Kelly, A
    Hancock, ER
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (03) : 315 - 327
  • [30] Contour Regularity Extraction Based on String Edit Distance
    Salas, Jose Ignacio Abreu
    Ramon Rico-Juan, Juan
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 160 - +