A novel method for mining frequent subtrees from XML data

被引:0
|
作者
Zhang, WS [1 ]
Liu, DX [1 ]
Zhang, JP [1 ]
机构
[1] Harbin Engn Univ, Dept Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the problem of finding frequent subtrees in a large collection of XML data, where both of the patterns and the data are modeled by labeled ordered trees. We present an efficient algorithm RSTMiner that computes all rooted subtrees appearing in a collection of XML data trees with frequent above a user-specified threshold using a special structure Me-tree. In this algorithm, Me-tree is used as a merging tree to supply scheme information for efficient pruning and mining frequent sub-trees. The keys of the algorithm are efficient pruning candidates with Me-Tree structure and incrementally enumerating all rooted sub-trees in canonical form based on a extended right most expansion technique. Experiment results show that RSTMiner algorithm is efficient and scalable.
引用
收藏
页码:300 / 305
页数:6
相关论文
共 50 条
  • [41] GP-Growth: A new algorithm for mining frequent embedded subtrees
    Hussein, Marwa M. A.
    Soliman, Taysir H. A.
    Karam, Omar H.
    2007 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1-3, 2007, : 524 - 531
  • [42] A Distributed Method for Fast Mining Frequent Patterns From Big Data
    Huang, Peng-Yu
    Cheng, Wan-Shu
    Chen, Ju-Chin
    Chung, Wen-Yu
    Chen, Young-Lin
    Lin, Kawuu W.
    IEEE ACCESS, 2021, 9 : 135144 - 135159
  • [43] A New Approximate Method For Mining Frequent Itemsets From Big Data *
    Valiullin, Timur
    Huang, Zhexue
    Wei, Chenghao
    Yin, Jianfei
    Wu, Dingming
    Egorova, Iuliia
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2021, 18 (03) : 641 - 656
  • [44] A Novel Top-down Algorithm of Frequent XML Query Pattern Mining
    Chang, Tsui-Ping
    Chen, Shih-Ying
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 276 - 280
  • [45] Incremental mining of frequent XML query patterns
    Chen, Y
    Yang, LH
    Wang, YG
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 343 - 346
  • [46] Mining Frequent User Query Patterns from XML Query Streams
    Chang, Tsui-Ping
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (05) : 452 - 458
  • [47] Incremental mining of frequent query patterns from XML queries for caching
    Li, Guoliang
    Feng, Jianhua
    Wang, Jianyong
    Zhang, Yong
    Zhou, Lizhu
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 350 - +
  • [48] Novel algorithm for frequent itemset mining in data warehouses
    Xu L.-J.
    Xie K.-L.
    Journal of Zhejiang University-SCIENCE A, 2006, 7 (2): : 216 - 224
  • [49] Weighted Frequent Itemset Mining Using Weighted Subtrees: WST-WFIM
    Nalousi, Saeed
    Farhang, Yousef
    Sangar, Amin Babazadeh
    IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2021, 44 (02): : 206 - 215
  • [50] A novel algorithm for frequent itemset mining in data warehouses
    徐利军
    谢康林
    Journal of Zhejiang University Science A(Science in Engineering), 2006, (02) : 216 - 224