EXiT-B: A new approach for extracting maximal frequent subtrees from XML data

被引:0
|
作者
Paik, J [1 ]
Won, D [1 ]
Fotouhi, F [1 ]
Kim, UM [1 ]
机构
[1] Sungkyunkwan Univ, Dept Comp Engn, Suwon 440746, Gyeonggi Do, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the increasing amounts of XML data available, the data mining community has been motivated to discover the useful information from the collections of XML documents. One of the most popular approaches to find the information is to extract frequent subtrees from a set of XML trees. In this paper, we propose a novel algorithm, EXiT-B, for efficiently extracting maximal frequent subtrees from a set of XML documents. The main contribution of our algorithm is that there is no need to perform tree join operation during the phase of generating maximal frequent subtrees. Thus, the task of finding maximal frequent subtrees can be significantly simplified comparing to the previous approaches.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [1] Simple yet efficient approach for maximal frequent subtrees extraction from a collection of XML documents
    Paik, Juryon
    Kim, Ung Mo
    WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 94 - 103
  • [2] A novel method for mining frequent subtrees from XML data
    Zhang, WS
    Liu, DX
    Zhang, JP
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 300 - 305
  • [3] Efficient data mining for maximal frequent subtrees
    Xiao, YQ
    Yao, JF
    Li, ZG
    Dunham, MH
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 379 - +
  • [5] Discovering frequent subtrees from XML data using neural networks
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    Wuhan Univ J Nat Sci, 2006, 1 (117-121):
  • [6] Mining frequent rooted subtrees in XML data with Me-tree
    Zhang, WS
    Liu, DX
    Zhang, JP
    2004 IEEE SYSTEMS & INFORMATION ENGINEERING DESIGN SYMPOSIUM, 2004, : 209 - 214
  • [7] Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach
    Kutty, Sangeetha
    Tran, Tien
    Nayak, Richi
    Li, Yuefeng
    FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 183 - 194
  • [8] Discovering frequent agreement subtrees from phylogenetic data
    Zhang, Sen
    Wang, Jason T. L.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (01) : 68 - 82
  • [9] Mining closed and maximal frequent subtrees from databases of labeled rooted trees
    Chi, Y
    Xia, Y
    Yang, YR
    Muntz, RR
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) : 190 - 202
  • [10] Mining frequent patterns from XML data
    Win, Chit Nilar
    Hla, Khin Haymar Saw
    APSITT 2005: 6th Asia-Pacific Symposium on Information and Telecommunication Technologies, Proceedings, 2005, : 208 - 212