An Optimal Decomposition Algorithm for Tree Edit Distance

被引:96
|
作者
Demaine, Erik D. [1 ]
Mozes, Shay [2 ]
Rossman, Benjamin
Weimann, Oren
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Brown Univ, Providence, RI 02912 USA
关键词
Decomposition strategy; dynamic programming; edit distance; ordered trees; tree edit distance;
D O I
10.1145/1644015.1644017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The edit distance between two ordered rooted trees with vertex labels is the minimum cost of transforming one tree into the other by a sequence of elementary operations consisting of deleting and relabeling existing nodes, as well as inserting new nodes. In this article, we present a worst-case O(n(3))-time algorithm for the problem when the two trees have size n, improving the previous best O(n(3) log n)-time algorithm. Our result requires a novel adaptive strategy for deciding how a dynamic program divides into subproblems, together with a deeper understanding of the previous algorithms for the problem. We prove the optimality of our algorithm among the family of decomposition strategy algorithms-which also includes the previous fastest algorithms-by tightening the known lower bound of Omega(n(2) log(2) n) to Omega(n(3)), matching our algorithm's running time. Furthermore, we obtain matching upper and lower bounds for decomposition strategy algorithms of Theta(nm(2)(1+ log n/m) when the two trees have sizes m and n and m < n.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Clustering of Synthetic Routes Using Tree Edit Distance
    Genheden, Samuel
    Engkvist, Ola
    Bjerrum, Esben
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) : 3899 - 3907
  • [42] Phylogenetic tree reconstruction based on normalized edit distance
    Li, Yu-Jian
    Wang, Fang-Yuan
    Beijing Gongye Daxue Xuebao / Journal of Beijing University of Technology, 2008, 34 (11): : 1211 - 1215
  • [43] Computing Text Similarity using Tree Edit Distance
    Sidorov, Grigori
    Gomez-Adorno, Helena
    Markov, Ilia
    Pinto, David
    Loya, Nahun
    2015 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY DIGIPEN NAFIPS 2015, 2015,
  • [44] TRACTABLE AND INTRACTABLE VARIATIONS OF UNORDERED TREE EDIT DISTANCE
    Yamamoto, Yoshiyuki
    Hirata, Kouichi
    Kuboyama, Tetsuji
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2014, 25 (03) : 307 - 329
  • [45] Breaking the Cubic Barrier for (Unweighted) Tree Edit Distance
    Mao, Xiao
    2021 IEEE 62ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2021), 2022, : 792 - 803
  • [46] Tree Edit Distance as a Baseline Approach for Paraphrase Representation
    Vila, Marta
    Dras, Mark
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (48): : 89 - 95
  • [47] Tree edit distance: Robust and memory-efficient
    Pawlik, Mateusz
    Augsten, Nikolaus
    INFORMATION SYSTEMS, 2016, 56 : 157 - 173
  • [48] Extend tree edit distance for effective object identification
    Yue Wang
    Hongzhi Wang
    Liyan Zhang
    Yang Wang
    Jianzhong Li
    Hong Gao
    Knowledge and Information Systems, 2016, 46 : 629 - 656
  • [49] Tree Edit Distance Problems: Algorithms and Applications to Bioinformatics
    Akutsu, Tatsuya
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (02): : 208 - 218
  • [50] Distributed algorithm for parallel edit distance computation
    Sadiq M.U.
    Yousaf M.M.
    Computing and Informatics, 2021, 39 (04) : 757 - 779