Efficiently mining frequent embedded unordered trees

被引:1
|
作者
Zaki, MJ [1 ]
机构
[1] Rensselaer Polytech Inst, CSCI, Dept Comp Sci, Troy, NY 12180 USA
关键词
tree mining; embedded trees; unordered trees;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mining frequent trees is very useful in domains like bioinformatics, web mining, mining semi-structured data, and so on. In this paper we introduce SLEUTH, an efficient algorithm for mining frequent, unordered, embedded subtrees in a database of labeled trees. The key contributions of our work are as follows: We give the first algorithm that enumerates all embedded, unordered trees. We propose a new equivalence class extension scheme to generate all candidate trees. We extend the notion of scope-list joins to compute frequency of unordered trees. We conduct performance evaluation on several synthetic and real datasets to show that SLEUTH is an efficient algorithm, which has performance comparable to TreeMiner, that mines only ordered trees.
引用
收藏
页码:33 / 52
页数:20
相关论文
共 50 条
  • [21] POTMiner: mining ordered, unordered, and partially-ordered trees
    Aída Jiménez
    Fernando Berzal
    Juan-Carlos Cubero
    Knowledge and Information Systems, 2010, 23 : 199 - 224
  • [22] Mining frequent closed rooted trees
    José L. Balcázar
    Albert Bifet
    Antoni Lozano
    Machine Learning, 2010, 78 : 1 - 33
  • [23] Mining frequent closed rooted trees
    Balcazar, Jose L.
    Bifet, Albert
    Lozano, Antoni
    MACHINE LEARNING, 2010, 78 (1-2) : 1 - 33
  • [24] Efficiently mining frequent itemsets applied for textual aggregation
    Mustapha Bouakkaz
    Youcef Ouinten
    Sabine Loudcher
    Philippe Fournier-Viger
    Applied Intelligence, 2018, 48 : 1013 - 1019
  • [25] Mining Frequent Rooted Ordered Tree Generators Efficiently
    Yi, Shengwei
    Xu, Jize
    Peng, Yong
    Xiong, Qi
    Wang, Ting
    Ma, Shilong
    2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 132 - 139
  • [26] Efficiently using dancing links in mining frequent itemsets
    Hu, Yong
    Zhang, Xiaotong
    Hu, Bin
    Journal of Computational Information Systems, 2015, 11 (17): : 6229 - 6241
  • [27] Efficiently mining maximal frequent itemsets based on digraph
    Ren, Zhibo
    Zhang, Qiang
    Ma, Xiujuan
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 140 - +
  • [28] Efficiently mining frequent itemsets with weight and recency constraints
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Han-Chieh Chao
    Tzung-Pei Hong
    Applied Intelligence, 2017, 47 : 769 - 792
  • [29] Efficiently mining maximal frequent mutually associated patterns
    Zhou, Zhongmei
    Wu, Zhaohui
    Wang, Chunshan
    Feng, Yi
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 110 - 117
  • [30] Efficiently Using Matrix in Mining Maximum Frequent Itemset
    Liu Zhen-yu
    Xu Wei-xiang
    Liu Xumin
    THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 50 - 54