EXiT-B: A new approach for extracting maximal frequent subtrees from XML data

被引:0
|
作者
Paik, J [1 ]
Won, D [1 ]
Fotouhi, F [1 ]
Kim, UM [1 ]
机构
[1] Sungkyunkwan Univ, Dept Comp Engn, Suwon 440746, Gyeonggi Do, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the increasing amounts of XML data available, the data mining community has been motivated to discover the useful information from the collections of XML documents. One of the most popular approaches to find the information is to extract frequent subtrees from a set of XML trees. In this paper, we propose a novel algorithm, EXiT-B, for efficiently extracting maximal frequent subtrees from a set of XML documents. The main contribution of our algorithm is that there is no need to perform tree join operation during the phase of generating maximal frequent subtrees. Thus, the task of finding maximal frequent subtrees can be significantly simplified comparing to the previous approaches.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [41] A model and an algorithm to mine maximal frequent itemsets from multidimensional data stream
    Mao, Guo-Jun
    Sun, Xiao-Xi
    Zong, Dong-Jun
    Beijing Gongye Daxue Xuebao/Journal of Beijing University of Technology, 2010, 36 (06): : 820 - 827
  • [42] Maximal and closed frequent itemsets mining from uncertain database and data stream
    Momtaz, Maliha
    Ferdaus, Abu Ahmed
    Ahmed, Chowdhury Farhan
    Samiullah, Mohammad
    International Journal of Data Science, 2019, 4 (03): : 237 - 259
  • [43] Drug-Drug Interaction Detection: A New Approach Based on Maximal Frequent Sequences
    Garcia-Blasco, Sandra
    Danger, Roxana
    Rosso, Paolo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 263 - 266
  • [44] New replication strategy based on maximal frequent correlated pattern mining for data grids
    Slimani, Sarra
    Hamrouni, Tarek
    Ben Charrada, Faouzi
    2014 15TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2014), 2014, : 144 - 151
  • [45] Extracting B → K* Form Factors from Data
    Hambrock, Christian
    Hiller, Gudrun
    PHYSICAL REVIEW LETTERS, 2012, 109 (09)
  • [46] A New Sequence-Based Approach for XML Data Query
    Li, Wen
    Yang, Jin
    Sun, Gaofeng
    Yue, Sen
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 661 - 670
  • [47] Towards a new approach for mining frequent itemsets on data stream
    Raissi, Chedy
    Poncelet, Pascal
    Teisseire, Maguelonne
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2007, 28 (01) : 23 - 36
  • [48] Towards a new approach for mining frequent itemsets on data stream
    Chedy Raïssi
    Pascal Poncelet
    Maguelonne Teisseire
    Journal of Intelligent Information Systems, 2007, 28 : 23 - 36
  • [49] A new approach for extracting information from protein dynamics
    Liu, Jenny
    Amaral, Luis A. N.
    Keten, Sinan
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (02) : 183 - 195
  • [50] Enhanced Compressed Maximal Frequent Patterns from COVID-19 Streaming Data
    Abdo, Asmaa S.
    Abdul-Kader, Hatem M.
    Salem, Rashed K.
    STUDIES IN INFORMATICS AND CONTROL, 2022, 31 (01): : 99 - 108