An Algorithm of Semi-structured Data Scheme Extraction Based on OEM Model

被引:0
|
作者
Gong, An [1 ]
Yang, Xue-wei [1 ]
机构
[1] China Univ Petr E China, Coll Comp & Commun Engn, Dongying 257061, Peoples R China
关键词
Semi-structured data; frequent patterns mining; OEM; the longest frequent label path;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to get the target model of semi-structured data rapidly, effectively and accurately, by combining the related nature of label path in the paper, this paper proposes an algorithm that can extract target model from the OEM model of semi-structured data directly. The basic idea of the Algorithm is: Using a Depth_First Search to get all of the label path expressions, with the help of the nature2 in this paper can reducing the number of path matching, we can generate all frequent label path expressions by layer. Finally, with the strategy of deletion we can get all of the longest frequent label path expressions effectively. Theoretical analysis and Experimental result shows that this algorithm can improve the accuracy of target model and reduce the size of candidate sets in pattern extraction.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [21] Research on the Data Model and the Approaches to Data Mining in the Semi-structured Data
    Liu, Fenghua
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 663 - 666
  • [22] WICCAO: From semi-structured data to structured data
    Li, Z
    Ng, WK
    11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOP ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2004, : 86 - 93
  • [23] Keyword Search on Structured and Semi-Structured Data
    Chen, Yi
    Wang, Wei
    Liu, Ziyang
    Lin, Xuemin
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1005 - 1009
  • [24] Data Warehouse Based Approach to the Integration of Semi-structured Data
    Ahmad, Houda
    Kermanshahani, Shokoh
    Simonet, Ana
    Simonet, Michel
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 88 - 99
  • [25] Conceptual Graphs Based Modeling of Semi-structured Data
    Varga, Viorica
    Sacarea, Christian
    Molnar, Andrea Eva
    GRAPH-BASED REPRESENTATION AND REASONING (ICCS 2018), 2018, 10872 : 167 - 175
  • [26] Chinese resume information extraction based on semi-structured text
    Wentan, Yan
    Yupeng, Qiao
    Chinese Control Conference, CCC, 2017, : 11177 - 11182
  • [27] Chinese resume information extraction based on semi-structured text
    Yan Wentan
    Qiao Yupeng
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11177 - 11182
  • [28] Privacy Preservation of Semi-structured Data Based on XML
    Shi, Cheng
    Yang, Mingda
    Ning, Bo
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 1081 - 1088
  • [29] A Search Service for Software Components based on a Semi-Structured Data Representation Model
    Brito, Talles
    Nobrega, Hugo
    Ribeiro, Thiago
    Elias, Gledson
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1479 - 1484
  • [30] Information Extraction of Strategic Activities based on Semi-structured Text
    Ma, Xubu
    Guo, Ju-E
    Ma, Xubu
    2014 SEVENTH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION (CSO), 2014, : 579 - 583