Efficiently mining frequent embedded unordered trees

被引:1
|
作者
Zaki, MJ [1 ]
机构
[1] Rensselaer Polytech Inst, CSCI, Dept Comp Sci, Troy, NY 12180 USA
关键词
tree mining; embedded trees; unordered trees;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mining frequent trees is very useful in domains like bioinformatics, web mining, mining semi-structured data, and so on. In this paper we introduce SLEUTH, an efficient algorithm for mining frequent, unordered, embedded subtrees in a database of labeled trees. The key contributions of our work are as follows: We give the first algorithm that enumerates all embedded, unordered trees. We propose a new equivalence class extension scheme to generate all candidate trees. We extend the notion of scope-list joins to compute frequency of unordered trees. We conduct performance evaluation on several synthetic and real datasets to show that SLEUTH is an efficient algorithm, which has performance comparable to TreeMiner, that mines only ordered trees.
引用
收藏
页码:33 / 52
页数:20
相关论文
共 50 条
  • [1] New Techniques for Mining Frequent Patterns in Unordered Trees
    Zhang, Sen
    Du, Zhihui
    Wang, Jason T. L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (06) : 1113 - 1125
  • [2] Mining frequent closed unordered trees through natural representations
    Balcazar, Jose L.
    Bifet, Albert
    Lozano, Antoni
    CONCEPTUAL STRUCTURES: KNOWLEDGE ARCHITECTURES FOR SMART APPLICATIONS, PROCEEDINGS, 2007, 4604 : 347 - +
  • [3] BEST: An Efficient Algorithm for Mining Frequent Unordered Embedded Subtrees
    Chowdhury, Israt Jahan
    Nayak, Richi
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 459 - 471
  • [4] BEST: An efficient algorithm for mining frequent unordered embedded subtrees
    Chowdhury, Israt Jahan, 1600, Springer Verlag (8862):
  • [5] Efficiently mining frequent trees in a forest: Algorithms and applications
    Zaki, MJ
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (08) : 1021 - 1035
  • [6] Efficiently Methods for Embedded Frequent Subtree Mining on Biological Data
    Liu, Wei
    Chen, Ling
    Zheng, Lan
    PROCEEDINGS OF INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2009), 2009, : 30 - 34
  • [7] Mining induced and embedded subtrees in ordered, unordered, and partially-ordered trees
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 111 - 120
  • [8] Discovering frequent substructures in large unordered trees
    Asai, T
    Arimura, H
    Uno, T
    Nakano, S
    DISCOVERY SCIENCE, PROCEEDINGS, 2003, 2843 : 47 - 61
  • [9] Mining maximal embedded unordered tree patterns
    Chehreghani, Mostafa Haghir
    Rahgozar, Masoud
    Lucas, Caro
    Chehreghani, Morteza Haghir
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 437 - 443
  • [10] Efficiently mining maximal frequent itemsets
    Gouda, K
    Zaki, MJ
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 163 - 170