A Schema Feature Based Frequent Pattern Mining Algorithm for Semi-structured Data Stream

被引:0
|
作者
Fu, Weiqi [1 ]
Liao, Husheng [1 ]
Jin, Xueyun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
frequent pattern mining; semi-structured data stream; schema feature;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining is used to find useful information from massive data. Frequent pattern mining is one important task of data mining. Recently, the researches on frequent pattern mining for semi-structured data have made some progresses, and it also have a lot of focuses for data stream. However, only a few studies focus on both semi-structured data and data stream. This paper proposes an algorithm named SPrefixTreeISpan. We segment the semi-structured data stream first, and then uses the pattern-growth method to mine each segment. In the end, we maintain all the results on a structure called patternTree. At the same time, the mining algorithm is optimized by the inevitable parent-child relationship and the inevitable child-parent relationship extracted from XML schema. Experiment shows that SPrefixTreeISpan has better performance.
引用
收藏
页码:1329 / 1336
页数:8
相关论文
共 50 条
  • [31] Querying semi-structured data
    Abiteboul, S
    DATABASE THEORY - ICDT'97, 1997, 1186 : 1 - 18
  • [32] Efficient algorithms for finding frequent substructures from semi-structured data streams
    Asai, Tatsuya
    Abe, Kenji
    Kawasoe, Shinji
    Arimura, Hiroki
    Arikawa, Setsuo
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 3609 : 29 - +
  • [33] Mining schemas in semi-structured data using fuzzy decision trees
    Wei, S
    Da-xin, L
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, VOL 4, PROCEEDINGS, 2005, 3483 : 753 - 761
  • [34] WICCAO: From semi-structured data to structured data
    Li, Z
    Ng, WK
    11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOP ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2004, : 86 - 93
  • [35] A frequent pattern mining algorithm for feature extraction of customer reviews
    Ibrahim, R., 1600, International Journal of Computer Science Issues (IJCSI) (09): : 4 - 1
  • [36] Keyword Search on Structured and Semi-Structured Data
    Chen, Yi
    Wang, Wei
    Liu, Ziyang
    Lin, Xuemin
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1005 - 1009
  • [37] Data Warehouse Based Approach to the Integration of Semi-structured Data
    Ahmad, Houda
    Kermanshahani, Shokoh
    Simonet, Ana
    Simonet, Michel
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 88 - 99
  • [38] A semi-structured document model for text mining
    Yang, JW
    Chen, XO
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (05) : 603 - 610
  • [39] A semi-structured document model for text mining
    Jianwu Yang
    Xiaoou Chen
    Journal of Computer Science and Technology, 2002, 17 : 603 - 610
  • [40] Conceptual Graphs Based Modeling of Semi-structured Data
    Varga, Viorica
    Sacarea, Christian
    Molnar, Andrea Eva
    GRAPH-BASED REPRESENTATION AND REASONING (ICCS 2018), 2018, 10872 : 167 - 175