Efficient Mining of Frequent Closed XML Query Pattern

被引:0
|
作者
Jian-Hua Feng
Qian Qian
Jian-Yong Wang
Li-Zhu Zhou
机构
[1] Tsinghua University,Department of Computer Science and Technology
关键词
computer software; frequent closed pattern; data mining; XML; XPath;
D O I
暂无
中图分类号
学科分类号
摘要
Previous research works have presented convincing arguments that a frequent pattern mining algorithm should not mine all frequent but only the closed ones because the latter leads to not only more compact yet complete result set but also better efficiency. Upon discovery of frequent closed XML query patterns, indexing and caching can be effectively adopted for query performance enhancement. Most of the previous algorithms for finding frequent patterns basically introduced a straightforward generate-and-test strategy. In this paper, we present SOLARIA*, an efficient algorithm for mining frequent closed XML query patterns without candidate maintenance and costly tree-containment checking. Efficient algorithm of sequence mining is involved in discovering frequent tree-structured patterns, which aims at replacing expensive containment testing with cheap parent-child checking in sequences. SOLARIA* deeply prunes unrelated search space for frequent pattern enumeration by parent-child relationship constraint. By a thorough experimental study on various real-life data, we demonstrate the efficiency and scalability of SOLARIA* over the previous known alternative. SOLARIA* is also linearly scalable in terms of XML queries’ size.
引用
收藏
页码:725 / 735
页数:10
相关论文
共 50 条
  • [31] Structural joins: A primitive for efficient XML query pattern matching
    Al-Khalifa, S
    Jagadish, HV
    Koudas, N
    Patel, JM
    Srivastava, D
    Wu, YQ
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 141 - 152
  • [32] An efficient algorithm for incrementally mining frequent closed itemsets
    Yen, Show-Jane
    Lee, Yue-Shi
    Wang, Chiu-Kuang
    APPLIED INTELLIGENCE, 2014, 40 (04) : 649 - 668
  • [33] An Efficient Mining Model for Global Frequent Closed Itemsets
    Lin, Jianming
    Ju, Chunhua
    Liu, Dongsheng
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL II, 2009, : 278 - 282
  • [34] MFCC: An efficient algorithm for frequent closed cube mining
    Xu, Guangyu
    Zhao, Yuhai
    Wang, Guoren
    Mo, Xiaojing
    Yin, Ying
    Journal of Computational Information Systems, 2007, 3 (05): : 2007 - 2012
  • [35] Accelerated Frequent Closed Sequential Pattern Mining for uncertain data
    You, Tao
    Sun, Yue
    Zhang, Ying
    Chen, Jinchao
    Zhang, Peng
    Yang, Mei
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [36] Compact transaction database for efficient frequent pattern mining
    Wan, Q
    An, AJ
    2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 652 - 659
  • [37] An efficient mining algorithm for frequent pattern in intrusion detection
    Li, QH
    Xiong, JJ
    Yang, HB
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 138 - 142
  • [38] Constructing Projection Frequent Pattern Tree for Efficient Mining
    Xiang Jian-wen
    State Key Lab of Software Engineering
    School of Information Science
    WuhanUniversityJournalofNaturalSciences, 2003, (02) : 351 - 357
  • [39] An Efficient Frequent Pattern Mining Algorithm for Data Stream
    Liu Hualei
    Lin Shukuan
    Qiao Jianzhong
    Yu Ge
    Lu Kaifu
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 757 - 761
  • [40] Efficient pattern-growth methods for frequent tree pattern mining
    Wang, C
    Hong, MS
    Pei, J
    Zhou, HF
    Wang, W
    Shi, BL
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 441 - 451