EXiT-B: A new approach for extracting maximal frequent subtrees from XML data

被引:0
|
作者
Paik, J [1 ]
Won, D [1 ]
Fotouhi, F [1 ]
Kim, UM [1 ]
机构
[1] Sungkyunkwan Univ, Dept Comp Engn, Suwon 440746, Gyeonggi Do, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the increasing amounts of XML data available, the data mining community has been motivated to discover the useful information from the collections of XML documents. One of the most popular approaches to find the information is to extract frequent subtrees from a set of XML trees. In this paper, we propose a novel algorithm, EXiT-B, for efficiently extracting maximal frequent subtrees from a set of XML documents. The main contribution of our algorithm is that there is no need to perform tree join operation during the phase of generating maximal frequent subtrees. Thus, the task of finding maximal frequent subtrees can be significantly simplified comparing to the previous approaches.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [31] Incremental mining maximal frequent patterns from univariate uncertain data
    Fasihy, Hanieh
    Shahraki, Mohammad Hossein Nadimi
    KNOWLEDGE-BASED SYSTEMS, 2018, 152 : 40 - 50
  • [32] A new path expression computing approach for XML data
    Lv, JH
    Wang, GR
    Yu, JX
    Yu, G
    Lu, HJ
    Sun, B
    EFFICIENCY AND EFFECTIVENESS OF XML TOOLS AND TECHNIQUES AND DATA INTEGRATION OVER THE WEB, 2003, 2590 : 35 - 46
  • [33] DOM-based algorithm of mining frequent patterns from XML data
    Department of Computer, Nanjing Normal University, Nanjing 210097, China
    Nanjing Hangkong Hangtian Daxue Xuebao, 2006, 2 (206-211):
  • [34] Extracting useful knowledge from event logs: A frequent itemset mining approach
    Djenouri, Youcef
    Belhadi, Asma
    Fournier-Viger, Philippe
    KNOWLEDGE-BASED SYSTEMS, 2018, 139 : 132 - 148
  • [35] Extracting data from WSNs: A data-oriented approach
    Schreiber, Fabio A.
    Camplani, Romolo
    Rota, Guido
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7200 LNCS : 357 - 373
  • [36] A New Approach for Mining Frequent Items in Data Stream
    Tu, Li
    Chen, Ling
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 225 - 228
  • [37] Extracting non-Gaussian governing laws from data on mean exit time
    Zhang, Yanxia
    Duan, Jinqiao
    Jin, Yanfei
    Li, Yang
    CHAOS, 2020, 30 (11)
  • [38] Extracting knowledge from XML document repository: a semantic Web-based approach
    Kim, Henry M.
    Sengupta, Arijit
    INFORMATION TECHNOLOGY & MANAGEMENT, 2007, 8 (03): : 205 - 221
  • [39] Extracting knowledge from XML document repository: a semantic Web-based approach
    Henry M. Kim
    Arijit Sengupta
    Information Technology and Management, 2007, 8 : 205 - 221
  • [40] An Approach to Extracting Sub-schema Similarities from Semantically Heterogeneous XML Schemas
    De Meo, Pasquale
    Quattrone, Giovanni
    Terracina, Giorgio
    Ursino, Domenico
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (04): : 397 - 420