A Schema Feature Based Frequent Pattern Mining Algorithm for Semi-structured Data Stream

被引:0
|
作者
Fu, Weiqi [1 ]
Liao, Husheng [1 ]
Jin, Xueyun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
frequent pattern mining; semi-structured data stream; schema feature;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining is used to find useful information from massive data. Frequent pattern mining is one important task of data mining. Recently, the researches on frequent pattern mining for semi-structured data have made some progresses, and it also have a lot of focuses for data stream. However, only a few studies focus on both semi-structured data and data stream. This paper proposes an algorithm named SPrefixTreeISpan. We segment the semi-structured data stream first, and then uses the pattern-growth method to mine each segment. In the end, we maintain all the results on a structure called patternTree. At the same time, the mining algorithm is optimized by the inevitable parent-child relationship and the inevitable child-parent relationship extracted from XML schema. Experiment shows that SPrefixTreeISpan has better performance.
引用
收藏
页码:1329 / 1336
页数:8
相关论文
共 50 条
  • [21] SCHEMADRILL: Interactive Semi-Structured Schema Design
    Spoth, William
    Xie, Ting
    Kennedy, Oliver
    Yang, Ying
    Hammerschmidt, Beda
    Liu, Zhen Hua
    Gawlick, Dieter
    HILDA'18: PROCEEDINGS OF THE WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, 2018,
  • [22] An Algorithm of Semi-structured Data Scheme Extraction Based on OEM Model
    Gong, An
    Yang, Xue-wei
    ADVANCED RESEARCH ON ELECTRONIC COMMERCE, WEB APPLICATION, AND COMMUNICATION, PT 1, 2011, 143 : 315 - 319
  • [23] Schema Discovery of Semi-structured Hierarchical Data Based on OEM Model and Hierarchical Transactional Database
    Lv, Cheng
    Wei, Chu-yuan
    Hao, Ying
    ICMECG: 2009 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT, PROCEEDINGS, 2009, : 172 - 175
  • [24] Algorithm based on counting for mining frequent items over data stream
    Zhu, Ranwei
    Wang, Peng
    Liu, Majin
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2011, 48 (10): : 1803 - 1811
  • [25] A Novel Frequent Pattern Mining Algorithm for Real-time Radar Data Stream
    Huang, Fang
    Zheng, Ningning
    TRAITEMENT DU SIGNAL, 2019, 36 (01) : 23 - 30
  • [26] Research of frequent pattern mining from XML data based on heterogeneous XML schema
    College of Computer Science, Chongqing University, Chongqing 400044, China
    不详
    J. Comput. Inf. Syst., 2008, 3 (787-794):
  • [27] An Algorithm for Mining Frequent Closed Itemsets in Data Stream
    Dai, Caiyan
    Chen, Ling
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL I, 2010, : 281 - 284
  • [28] An Efficient Algorithm for Mining Frequent Patterns in Data Stream
    Zhang Guang-lu
    Lei Jing-sheng
    INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 160 - +
  • [29] An Algorithm for Mining Frequent Closed Itemsets in Data Stream
    Dai, Caiyan
    Chen, Ling
    INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT C, 2012, 24 : 1722 - 1728
  • [30] Data Analysis for Gathering and Analysis of no structured and semi-structured information using Data Mining Technique
    Papavlasopoulos, Sozon
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 293 - +