Efficient Mining of Frequent Closed XML Query Pattern

被引:0
|
作者
Jian-Hua Feng
Qian Qian
Jian-Yong Wang
Li-Zhu Zhou
机构
[1] Tsinghua University,Department of Computer Science and Technology
关键词
computer software; frequent closed pattern; data mining; XML; XPath;
D O I
暂无
中图分类号
学科分类号
摘要
Previous research works have presented convincing arguments that a frequent pattern mining algorithm should not mine all frequent but only the closed ones because the latter leads to not only more compact yet complete result set but also better efficiency. Upon discovery of frequent closed XML query patterns, indexing and caching can be effectively adopted for query performance enhancement. Most of the previous algorithms for finding frequent patterns basically introduced a straightforward generate-and-test strategy. In this paper, we present SOLARIA*, an efficient algorithm for mining frequent closed XML query patterns without candidate maintenance and costly tree-containment checking. Efficient algorithm of sequence mining is involved in discovering frequent tree-structured patterns, which aims at replacing expensive containment testing with cheap parent-child checking in sequences. SOLARIA* deeply prunes unrelated search space for frequent pattern enumeration by parent-child relationship constraint. By a thorough experimental study on various real-life data, we demonstrate the efficiency and scalability of SOLARIA* over the previous known alternative. SOLARIA* is also linearly scalable in terms of XML queries’ size.
引用
收藏
页码:725 / 735
页数:10
相关论文
共 50 条
  • [41] An Efficient Parallel Method for Mining Frequent Closed Sequential Patterns
    Bao Huynh
    Bay Vo
    Snasel, Vaclav
    IEEE ACCESS, 2017, 5 : 17392 - 17402
  • [42] An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams
    Ao, Fujiang
    Du, Jing
    Yan, Yuejin
    Liu, Baohong
    Huang, Kedi
    8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 37 - +
  • [43] PGLCM: efficient parallel mining of closed frequent gradual itemsets
    Trong Dinh Thac Do
    Termier, Alexandre
    Laurent, Anne
    Negrevergne, Benjamin
    Omidvar-Tehrani, Behrooz
    Amer-Yahia, Sihem
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (03) : 497 - 527
  • [44] IFCIA: An efficient algorithm for mining intertransaction frequent closed itemsets
    Dong, Jie
    Han, Min
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 678 - +
  • [45] PGLCM: efficient parallel mining of closed frequent gradual itemsets
    Trong Dinh Thac Do
    Alexandre Termier
    Anne Laurent
    Benjamin Negrevergne
    Behrooz Omidvar-Tehrani
    Sihem Amer-Yahia
    Knowledge and Information Systems, 2015, 43 : 497 - 527
  • [46] An efficient maximal frequent itemsets mining algorithm - Based on frequent pattern tree
    Xue, XR
    Wang, GY
    Wu, Y
    Yang, SX
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2005, 1 : 176 - 181
  • [47] NUCLEAR: An Efficient Methods for Mining Frequent Itemsets and Generators from Closed Frequent Itemsets
    Huy Quang Pham
    Duc Tran
    Ninh Bao Duong
    Fournier-Viger, Philippe
    Alioune Ngom
    INFORMATION TECHNOLOGY IN INDUSTRY, 2019, 7 (02): : 1 - 13
  • [48] Frequent Closed Pattern Mining Algorithm Based on COFI-Tree
    Xiao, Jihai
    Cui, Xiaohong
    Chen, Junjie
    EMERGING RESEARCH IN ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, 2011, 237 : 175 - +
  • [49] BUXMiner: An efficient bottom-up approach to mining XML query patterns
    Bei, Yijun
    Chen, Gang
    Dong, Jinxiang
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2007, 4505 : 709 - +
  • [50] Efficient quantitative frequent pattern mining using predicate trees
    Wang, BY
    Pan, F
    Cui, Y
    Perrizo, W
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 168 - 171